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ABSTRACT 

This paper formulates a new approach to the study of chaos in discrete dynamical systems based on 
the notions of inverse ill-posed problems, set-valued mappings, generalized and multivalued inverses . 
Graphical convergence of a net of functions in an extended multifunction space IjSengupta and 



120001) . and the topological theory of convergence. Order, chaos, and complexity are described as distinct 
components of this unified mathematical structure that can be viewed as an application of the theory of 
convergence in topological spaces to increasingly nonlinear mappings, with the boundary between order 
and complexity in the topology of graphical convergence being the region in Multi(X) that is susceptible 
to chaos. The paper u ses results from the discretized spectral approximation in neutron transport theory 
()SenguDtaLll988lfl995l) and concludes that the numerically exact results obtained by this approximation of 
the Case singular eigenfunction solution is due to the graphical convergence of the Poisson and conjugate 
Poisson kernels to the Dirac delta and the principal value multifunctions respectively. In Multi(X), the 
continuous spectrum is shown to reduce to a point spectrum, and we introduce a notion of latent chaotic 
states to interpret superposition over generalized eigenfunctions. Along with these latent states, spectral 
theory of nonlinear operators is used to conclude that nature supports complexity to attain efficiently a 
multiplicity of states that otherwise would remain unavailable to it. 

Keywords: chaos, complexity, ill-posed problems, graphical convergence, topology, multifunctions. 



Prologue 

1. Generally speaking, the analysis of chaos is extremely difficult. While a general definition for chaos 
applicable to most cases of interest is still lacking, mathematicians agree that for the special case of iteration 
of transformations there are three common characteristics of chaos: 

1. Sensitive dependence on initial conditions, 

2. Mixing, 

3. Dense periodic points. 



Peitgen et aP ()l992h 



2. The study of chaos is a part of a larger program of study of so-called "strongly" nonlinear system. ■ ■ ■ 
Linearity means that the rule that determines what a piece of a system is going to do next is not influenced by 
what it is doing now. More precisely this is intended in a differential or incremental sense: For a linear spring, 
the increase of its tension is proportional to the increment whereby it is stretched, with the ratio of these 
increments exactly independent of how much it has already been stretched. Such a spring can be stretched 
arbitrarily far ■ ■ ■ . Accordingly no real spring is linear. The mathematics of linear objects is particularly 
felicitous. As it happens, linear objects enjoy an identical, simple geometry. The simplicity of this geometry 
always allows a relatively easy mental image to capture the essence of a problem, with the technicality, growing 
with the number of parts, basically a detail. The historical prejudice against nonlinear problems is that no so 
simple nor universal geometry usually exists. 



Mitchell Feigenbaum's Foreword (pp 1-7) in lPeitgen et al. 
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3. The objective of this symposium is to explore the impact of the emerging science of chaos on various 
disciplines and the broader implications for science and society. The characteristic of chaos is its universality 
and ubiquity. At this meeting, for example, we have scholars representing mathematics, physics, biology, 
geophysics and geophysiology astronomy, medicine, psychology, meteorology, engineering, computer science, 
economics and social sciences^ . Having so many disciplines meeting together, of course, involves the risk that 
we might not always speak the same language, even if all of us have come to talk about "chaos". 

Opening address of Heitor Gurgulino de Souza, Rector United Nations University, Tokyo 

de Soiizai fim^) 



4. The predominant approach (of how the different fields of science relate to one other) is reductionist: 
Questions in physical chemistry can be understood in terms of atomic physics, cell biology in terms of how 
biomolecules work ■ ■ ■ . We have the best of reasons for taking this reductionist approach: it works. But 
shortfalls in reductionism are increasingly apparent (and) there is something to be gained from supplementing 
the predominantly reductionist approach with an integrative agenda. This special section on complex systems 
is an initial scan (where) we have taken a "complex system" to be one whose properties are not fully explained 
by an understanding of its component parts. Each Viewpoint author^ was invited to define "complex" as it 
applied to his or her discipline. 

Gallagher and Appenzeller ( 19991 ) 



5. One of the most striking aspects of physics is the simplicity of its laws. Maxwell's equations, 
Schroedinger's equations, and Hamilton mechanics can each be expressed in a few lines. ■ ■ ■ Everything 
is simple and neat except, of course, the world. Every place we look outside the physics classroom we see 
an world of amazing complexity. ■ ■ ■ So why, if the laws are so simple, is the world so complicated? To us 
complexity means that we have structure with variations. Thus a living organism is complicated because it 
has many different working parts, each formed by variations in the working out of the same genetic coding. 
Chaos is also found very frequently. In a chaotic world it is hard to predict which variation will arise in a given 
place and time. A complex world is interesting because it is highly structured. A chaotic world is interesting 
because we do not know what is coming next. Our world is both complex and chaotic. Nature can produce 
complex structures even in simple situations and obey simple laws even in complex situations. 



Goldenfeld and Kadanoi 



6. Where chaos begins, classical science stops. For as long as the world has had physicists inquiring 
into the laws of nature, it has suffered a special ignorance about disorder in the atmosphere, in the turbulent 
sea, in the fluctuations in the wildlife populations, in the oscillations of the heart and the brain. But in the 
1970s a few scientists began to find a way through disorder. They were mathematicians, physicists, biologists, 
chemists ■ ■ ■ (and) the insights that emerged led directly into the natural world: the shapes of clouds, the 
paths of lightning, the microscopic intertwining of blood vessels, the galactic clustering of stars. ■ ■ ■ Chaos 
breaks across the lines that separate scientific disciplines, (and) has become a shorthand name for a fast 
growing movement that is reshaping the fabric of the scientific establishment. 



Gleicd (ll98'H ) 



7. order — > complexity — > chaos. 

WaJdronI (jl992h 



8. Our conclusions based on these examples seems simple: At present chaos is a philosophical term, not 
a rigorous mathematical term. It may be a subjective notion illustrating the present day limitations of the 
human intellect or it may describe an intrinsic property of nature such as the "randomness" of the sequence of 
prime numbers. Moreover, chaos may be undecidable in the sense of Godel in that no matter what definition 
is given for chaos, there is some example of chaos which cannot be proven to be chaotic from the definition. 



partial listing of papers is as follows: Chaos and Politics: Application of nonlinear dynamics to socio- 
political issues; Cfiaos in Society: Reflections on the impact of chaos theory on sociology; Chaos in neural networks; 
The impact of chaos on mathematics; The impact of chaos on physics; The impact of chaos on economic theory; 
The impact of chaos on engineering; The impact of chaos on biology; Dynamical disease: and The impact of 
nonlinear dynamics and chaos on cardiology and medicine. 

^The eight Viewpoint articles are titled: Simple Lessons from Complexity; Complexity in Chemistry; Complex- 
ity m Biological Signaling Systems; Complexity and the Nervous System; Complexity, Pattern, and Evolutionary 
Trade-Offs in Animal Aggregation; Complexity in Natural Landform Patterns; Complexity and Climate and Com- 
plexity and the Economy. 
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Brown and Chnal hmd ) 



9. My personal feeling is that the definition of a "fractal" should be regarded in the same way as the 
biologist regards the definition of "life". There is no hard and fast definition, but just a list of properties 
characteristic of a living thing ■ ■ ■ . Most living things have most of the characteristics on the list, though 
there are living objects that are exceptions to each of them. In the same way, it seems best to regard a fractal 
as a set that has properties such as those listed below, rather than to look for a precise definition which will 
certainly exclude some interesting cases. 



Falconeri (|1 



10. Dynamical systems are often said to exhibit chaos without a precise definition of what this means. 

RobinsonI 



1. Introduction 

The purpose of this paper is to present an unified, self-contained mathematical structure and 
physical understanding of the nature of chaos in a discrete dynamical system and to suggest 
a plausible explanation of why natural systems tend to be chaotic. The somewhat extensive 
quotations with which we begin above, bear testimony to both the increasingly significant — 
and perhaps all-pervasive — role of nonlinearity in the world today as also our imperf ect state of 
unde rstanding of its m anifestations. The list of papers a t both the UN Conference ( de Souzal 



1993) and in Science dCallagher and Appenzelleil . ^99^1) is noteworthy if only to justify the 



observation of Gleickl (jl987^ that "chaos seems to be everywhere". Even as everybody appears 



to be finding chaos and complexity in all likely and unlikely places, and possibly because of it, it 
is necessary that we have a clear mathematically-physical understanding of these notions that 
are supposedly reshaping our view of nature. This paper is an attempt to contribute to this goal. 
To make this account essentially self-contained we include here, as far as this is practicable, the 
basics of the background material needed to understand the paper in the form of Tutorials and 
an extended Appendix. 

The paradigm of chaos of the kneading of the dou gh is considered to provide an intuitive 
basis of the mathematics of chaos ( Peiteen et al.l . 1992h . and one of our fundamental objectives 



here is to recount the mathematical frai nework of this pr ocess in terms of the theory of ill-posed 
problems arising from n qn-injectivity dSenguptal. Il997ll . maximal ill-posedness, and graphical 
convergence of functions ()Sengupta and Ravi . l2nnnl v A natural mathematical formulation of the 



kneading of the dough in the form of stretch- cut- and-paste and stretch- cut- and- fold operations 
is in the ill-posed problem arising from the increasing non-injectivity of the function / modeling 
the kneading operation. 

Begin Tutoriall: Functions and Multifunctions 

A relation, or correspondence, between two sets X and Y, written "M: X Y, is basically a 
rule that associates subsets of X to subsets of Y; this is often expressed as {A,B) £ M where 
A C X and B cY and {A,B) is an ordered pair of sets. The domain 

V{M) = {A cX:(3Ze M){ttx{Z) = A)} 



and range 



7^(M) = {B CY:^Z€ M)(7ry(Z) = B)} 



of M are respectively the sets of X which under M corresponds to sets in Y; here ttx and vry 
are the projections of Z on X and Y respectively. Equivalently, V^M.) = {x £ X : M(x) / 0} 
and TZ(M.) = Ua'SfCX) ^i^)- The inverse M~ of M is the relation 

M~ = {{B,A): {A,B)eM} 
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so that M~ assigns A to iff M assigns B to A. In general, a relation may assign many 
elements in its range to a single element from its domain; of especial significance are functional 
relations that can assigns only a unique element in TZ{f) to any element in T>{f). Fig. ^ 
illustrates the distinction between arbitrary and functional relations M and /. This difference 
between functions (or maps) and multifunctions is basic to our developments and should be 
fully understood. Functions can again be classified as injections (or 1 : 1) and surjections (or 
onto), f : X ^ Y \s said to be injective (or one-to-one) if xi X2 ^ fi^i) 7^ f{^2) for all 
xi,X2 G X, while it is surjective (or onto) if y = f{X). f is bijective if it is both 1 : 1 and onto. 




Figure 1: Functional and non- functional relations between two sets X and Y: while / and g are functional 
relations, M is not. In figure (a) / and g are both injective and surjective (that is they are bijective), in (b) g is 
bijective but / is only injective and f~^{{y2}) 0, in (c) / is not 1 : 1, g is not onto, while in (d) M is not a 
function but is a multifunction. 

Associated with a function / : X — > 1" is its inverse : Y 5 Tl{f) — > X that exists on 7^(/) 
iff / is injective. Thus when / is bijective, f~^{y) := {x ^ X : y = f{x)} exists for every y £Y; 
infact / is bijective iff /~^({y}) is a singleton for each y £ Y. Non-injective functions are not at 
all rare; if anything, they are very common even for linear maps and it would be perhaps safe 
to conjecture that they are overwhelmingly predominant in the nonlinear world of nature. Thus 
for example, the simple linear homogeneous differential equation with constant coefficients of 
order n > 1 has n linearly independent solutions so that the operator Z)"" of D^{y) = has a 
n-dimensional null space. Inverses of non-injective, and in general non-bijective, functions will 
be denoted by /~. If / is not injective then 

A(irf{A) = s^t{A) 
where saX{A) is the saturation of A Q X induced by f; if / is not surjective then 

f_f_{B) ■.= Bf] f{X) C B. 

'^Wc do not distinguish between a relation and its graph although technically they are different objects. 
Thus although a functional relation, strictly speaking, is the triple {X, /, Y) written traditionally as 
f : X Y , we use it synonymously with the graph / itself. Parenthetically, the word functional in this 
work is not necessarily employed for a scalar-valued function, but is used in a wider sense to distinguish 
between a function and an arbitrary relation (that is a multifunction). Formally, whereas an arbitrary 
relation from X to F is a subset oi X xY, a functional relation must satisfy an additional restriction that 
requires yi ~ y2 whenever (x,yi) G / and (x,y2) G /• In this subset notation, {x,y) £ f ^ y = f(x). 
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li A = sat(^), then A is said to be saturated, and B C TZ{f) whenever ff~{B) = B. Thus 
for non-injective /, /"/ is not an identity on X just as //~ is not ly if / is not surjective. 
However the set of relations 

//"/ = /, /"//" = r (1) 

that is always true will be of basic significance in this work. Following are some equivalent 
statements on the injectivity and surjectivity of functions / : X — > y. 

(Injec) / is 1 : 1 44> there is a function /l : Y ^ X called the left inverse of /, such that 
/l/ = Ix ^ ^ = rf{A) for ah subsets A of X^f{i^ Ai) = Q /(^^). 

(Surjec) / is onto 44> there is a function /r -.Y ^ X called the right inverse of /, such that 
ff^ = \y^B = ff~{B) for all subsets B of Y. 

As we are primarily concerned with non-injectivity of functions, saturated sets generated by 
equivalence classes of / will play a significant role in our discussions. A relation E£ on a set X 
is said to be an equivalence relation if it is^ 

(ERl) Reflexive: (Vx G X){x£x). 

(ER2) Symmetric: (Vx,y G X){x£y =^ y£x). 

(ER3) Transitive: (Vx, y, z G X){x£y A y£z =^ x£z). 

Equivalence relations group together unequal elements xi 7^ X2 of a set as equivalent according 
to the requirements of the relation. This is expressed as xi ~ X2 (mod £) and will be represented 
here by the shorthand notation xi ~£: X2, or even simply as xi ~ X2 if the specification of £ is 
not essential. Thus for a noninjective map if /(xi) = /(X2) for xi 7^ X2, then xi and X2 can 
be considered to be equivalent to each other since they map onto the same point under /; thus 
xi ~/ X2 /(xi) = /(X2) defines the equivalence relation ~j induced by the map /. Given an 
equivalence relation ~ on a set X and an element x G X the subset 

[x] =^ {y G X : ?/ ~ x} 

is called the equivalence class of x; thus x ~ y 44> [x] = [y]. In particular, equivalence classes 
generated by / : X — > y, [x]f = {xa G X : f{xa) = f{x)}, will be a cornerstone of our 
analysis of chaos generated by the iterates of non-injective maps, and the equivalence relation 
~j:= {(x, y) : /(x) = f{y)} generated by / is uniquely defined by the partition that / induces on 
X. Of course as x ~ x, x G [x]. It is a simple matter to see that any two equivalence classes are 
either disjoint or equal so that the equivalence classes generated by an equivalence relation on 
X form a disjoint cover of X. The quotient set of X under ~, denoted by X/ ~ := {[x] : x G X}, 
has the equivalence classes [x] as its elements; thus [x] plays a dual role either as subsets of X 
or as elements of X/ ~. The rule x 1— > [x] defines a surjective function Q : X — > X/ ~ known as 
the quotient map. 

Example 1.1. Let 

S' = {{x,y)eR^):x' + y^ = l} 
be the unit circle in M?. Consider X = [0, 1] as a subspace of M, define a map 

q:X^S^, SI — > (cos 27rs, sin 27rs), s G X, 

from M to M^, and let ~ be the equivalence relation on X 

s ~ t (s = t) V (s = 0, t = 1) V (s = 1, f = 0). 

*An useful alternate way of expressing these properties for a relation M on X are 

(ERl) M is reflexive iff Ix C M 
(ER2) M is symmetric iff M = M~ 
(ER3) M is transitive iff M o M C M, 

with M an equivalence relation only if MoM = M, where ior M. C X xY and 'N CYx Z, the composition 
J^oM:= {{x,z) GX X Z:{3ye Y) {{x,y) G M) A {{y,z) G J^)} 
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If we bend X around till its ends touch, the resulting circle represents the quotient set Y = X/ ~ 
whose points are equivalent under ~ as follows 



[0] = {0, 1} = [1], [s] = {s} for ah s G (0, 1). 




Figure 2: The quotient map Q 

Thus q is bijective for s G (0, 1) but two-to-one for the special values s = and 1, so that 
for s,t £ X, 

s ~ t q{s) = q{t). 
This yields a bijection h : Xj ~ — > 5^ such that 

q = ho Q 

defines the quotient map Q : X — > Xj ~ by = for all s G [0, 1]. The situation is 

illustrated by the commutative diagram of Fig. |21 that appears as an integral component in 
a different and more general context in Sec. 2. It is to be noted that commutativity of the 
diagram implies that if a given equivalence relation ~ on X is completely determined by q that 
associates the partitioning equivalence classes in X to unique points in 5^, then ~ is identical 
to the equivalence relation that is induced by Q on X. Note that a larger size of the equivalence 
classes can be obtained by considering X = M+ for which s~t44>|s — t| G Z+. ■ 

End Tutoriall 

One of the central concepts that we consider and employ in this work is the inverse /~ of a 
nonlinear, non-injective, function /; here the equivalence classes \x\j = f~ f{x) oi x £ X are 
the saturated subsets of X that partition X. While a detailed treatment of this question in the 
form of the non-linear ill-posed problem and its solution is given in Sec. 2 lSenguntal ()l997lV it is 
sufficient to point out here from Figs. E^c) and^^d), that the inverse of a noninjective function is 
not a function but a multifunction while the inverse of a multifunction is a noninjective function. 
Hence one has the general result that 

/ is a non injective function <^=^ /~ is a multifunction. (2) 
/ is a multifunction <^=^ /~ is a non injective function 

The inverse of a multifunction M.: X y is a generalization of the corresponding notion for 
a function / : X — > y such that 

M^{y) =^{x G X: ?/ G M{x)} 

leads to 

M~{B) = {x G X : JV[{x) n ^ / 0} 
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for any B QY, while a more restricted inverse that we shah not be concerned with is given as 
M+(5) = {x e X: M(x) C B}. Obviously, M+{B) C M~{B). A multifunction is injective if 
xi ^ X2 ^ M(xi) Pi M(x2) = 0, and in common with functions it is true that 



VaGD / aeD) 



and where B is an index set. The following illustrates the difference between the two inverses 
of M. Let X be a set that is partitioned into two disjoint M- invariant subsets Xi and X2. If 
X G Xi (or X G X2) then M(x) represents that part of Xi (or of X2 ) that is realized immediately 
after one application of M, while M~(x) denotes the possible precursors of x in Xi (or of X2) 
and 'M'^(B) is that subset of X whose image lies in B for any subset B C X. 

In this work the multifunctions we are explicitly concerned with arise as the inverses of 
non-injective functions. 

The second major component of our theory is the graphical convergence of a net of functions 
to a multifunction. In Tutorial2 below, we replace for the sake of simplicity and without loss 
of generality, the net (which is basically a sequence where the index set is not necessarily the 
positive integers; thus every sequence is a net but the family^ indexed, for example, by Z, the 
set of all integers, is a net and not a sequence) with a sequence and provide the necessary 
background and motivation for the concept of graphical convergence. 



Begin Tutorial2: Convergence of Functions 

This Tutorial reviews the inadequacy of the usual notions of convergence of functions either 
to limit functions or to distributions and suggests the motivation and need for introduction of 
the notion of graphical conve rgence of functions to multifunctions. Here, we follow closely the 



exposition of iKorevaan ()l968l ). and use the notation {fk)k=i to denote real or complex valued 
functions on a bounded or unbounded interval J . 

A sequence of piecewise continuous functions {fk)kLi is said to converge to the function /, 
notation — > /, on a bounded or unbounded interval 

(1) Pointwise if 

fk{x) — > f{x) for all x £ J, 

that is: Given any arbitrary real number e > there exists a K £ 'N that may depend on x, 
such that \ fk{x) — f{x)\ < e for all k > K. 

(2) Uniformly if 

sup \ f (x) - fk{x)\ — >0 ask — > 00, 

that is: Given any arbitrary real number e > there exists a K gN, such that sup^^^j |/fc(x) — 
f{x)\ < e for aU k>K. 

(3) In the mean of order p > 1 if \ f{x) — fk{x)\^ is integrable over J for each k 



|/(x)-/fc(x)|P^O as A; ^00. 



J 



^ A function % : D ^ X will be called a family in X indexed by D when reference to the domain D is 
of interest, and a net when it is required to focus attention on its values in X. 

""Observe that it is not being claimed that / belongs to the same class as {fk)- This is the single most 
important cornerstone on which this paper is based: the need to "complete" spaces that are topologically 
"incomplete". The classical high-school example of the related problem of having to enlarge, or extend, 
spaces that are not big enough is the solution space of algebraic equations with real coefficients like 
a;2 + 1 = 0. 
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For p = 1, this is the simple case of convergence in the mean. 

(4) In the mean m-integrally if it is possible to select indefinite integrals 



^1 



^1 



fk "^hx) = TTkix) + I dxi I dX2-- - I dXmfkiXm) 

and 

f(-"^){x) = 7r{x) + J dxi J dx2---J dxmfixr, 
such that for some arbitrary real p > 1, 

I -/^r""^!" -^0 as A: ^ CO. 



where the polynomials iTk{x) and Tr{x) are of degree < m, and c is a constant to be chosen 
appropriately. 

(5) Relative to test functions (p if f(p and f^ip are integrable over J and 

j^ifk-f)(p — ^0 for every G C^(J) as A; — > oo, 

where C^{J) is the class of infinitely differentiable continuous functions that vanish throughout 
some neighbourhood of each of the end points of J. For an unbounded J, a function is said to 
vanish in some neighbourhood of +oo if it vanishes on some ray (r, oo). 

While pointwise convergence does not imply any other type of convergence, uniform conver- 
gence on a bounded interval implies all the other convergences. 

It is to be observed that apart from pointwise and uniform convergences, all the other modes 
listed above represent some sort of an averaged contribution of the entire interval J and are 
therefore not of much use when pointwise behaviour of the limit / is necessary. Thus while 
limits in the mean are not unique, oscillating functions are tamed by m-integral convergence for 
adequately large values of m, and convergence relative to test functions, as we see below, can 
be essentially reduced to m-integral convergence. On the contrary, our graphical convergence 

— which may be considered as a pointwise biconvergence with respect to both the direct and 
inverse images of / just as usual pointwise convergence is with respect to its direct image only 

— allows a sequence (in fact, a net) of functions to converge to an arbitrary relation, unhindered 
by external influences such as the effects of integrations and test functions. To see how this can 
indeed matter, consider the following 

Example 1.2. Let fkix) = sinkx, k = 1,2, ■ ■ ■ and let J be any bounded interval of the real 
line. Then 1-integrally we have 



fi ^\x) = —— coskx = —— + [ sinkxidxi, 
k k Jq 



which obviously converges to uniformly (and therefore in the mean) as /c — > cx). And herein 
lies the point: even though we cannot conclude about the exact nature of sinkx as k increases 
indefinitely (except that its oscillations become more and more pronounced), we may very 
definitely state that liuik^ooicos kx) / k = uniformly. Hence from 



fl ^\x) — > = -|- / lim s'mkxidxi 
Jo k^oo 



it follows that 



1-integrally. 



lim sin kx = Q (3) 
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Continuing with the same sequence of functions, we now examine its test-functional conver- 
gence with respect to 93 G Co(— 00, 00) that vanishes for all x ^ (a,/?). Integrating by parts, 

fw=l ^{xi) smkxidxi 



1 1 /"-^ 

— [93(2:1) cos fcxi]^ — — / if' {xi) coskxidxi 

k k 



The first integrated term is due to the conditions on ip while the second also vanishes because 
V? G Co(— 00, 00). Hence 



/oo />/3 
fk^ — >0= / lim ip{xi) sinksdxi 
-00 Ja 



for all (f, and leading to the conclusion that 

lim sin kx = (4) 

test-functionally. ■ 

This example illustrates the fact that if Supp((/3) = [a, 13] C j'^, integrating by parts suffi- 
ciently large number of times so as to wipe out the pathological behaviour of (fk) gives 

r0 



[ fk^= [ 

J J Ja 



fkf 

/rV' = --- = (-ir / 

i J a 

where fl "^\x) = TTk{x) + dxi f^^ dx2 • • • /^^"""^ dxmfk{xm) is an m-times arbitrary indefinite 
integral of fk- If now it is true that fj^ fj, ™^ /a /^"'"^ then it must also be true that 
fk '"V*'"*^ converges in the mean to (^("*) so that 

J a J a J a J a 

In fact the converse also holds leading to the following E quivalences betw een m-convergence in 
the mean and convergence with respect to test-functions . iKorevaar ( 19681 ) . 

Type 1 Equivalence. If / and [fk) are functions on J that are integrable on every interior 
subinterval, then the following are equivalent statements. 

(a) For every interior subinterval / of J there is an integer m/ > 0, and hence a smallest 
integer m > 0, such that certain indefinite integrals '"^ of the functions fk converge in the 
mean on / to an indefinite integral f^~"^^; thus J^- ™-) _ j(-m)| _^ q_ 

(b) Jjifk -/)</' ^ for every ^ G C^{J). 

A significant generalization of this Equivalence is obtained by dropping the restriction that 
the limit object / be a function. The need for this generalization arises because metric function 
spaces are known not to be complete: Consider the sequence of functions (Fig. Ofa)) 

if a < X < 
f^{x) = { kx if < X < l//c (5) 

1 ifl/k<x<b 



^By definition, the support (or supporting interval) of (p{x) G C^[a,P] is [a,f5] if ip and all its 
derivatives vanish for x < a and x > p. 



9 




Figure 3: Incompleteness of function spaces. Figure (a) demonstrates the classic example of non-completeness 
of the space of real- valued continuous functions leading to the complete spaces L„ [a, b] whose elements are 
equivalence classes of functions with / ~ g iff the Lebesgue integral \ f — g\" — 0. Figures (b) and (c) illustrate 
distributional convergence of the functions fk{x) of Eq. Q to the Dirac delta 5{x) leading to the complete space 
of generalized functions. In comparison, note that the space of continuous functions in the uniform metric C[a, b] 
is complete which suggests the importance of topologies in determining convergence properties of spaces. 



which is not Cauchy in the uniform metric p{fj, fk) = sup^^^^i, Ifji^) — fk{x)\ but is Cauchy in 

the mean p{fj, fk) = J^^ \ fjix) — fk{x)\dx, or even pointwise. However in either case, (fk) cannot 
converge in the respective metrics to a continuous function and the hmit is a discontinuous unit 
step function 

'0 if o < X < 
1 if < X < 6 



G(x) 



with graph ([a, 0], 0) |J((0, 6], 1), which is also integrable on [a,b]. Thus even if the hmit of 
the sequence of continuous functions is not continuous, both the hmit and the members of the 
sequence are integrable functions. This Riemann integration is not sufficiently general, however, 
and this type of integrability needs to be replaced by a much weaker condition resulting in the 
larger class of the Lebesgue integrable complete space of functions L[a, 6].® 



*Both Riemann and Lebesgue integrals can be formulated in terms of the so-called step functions s{x), 
which are piecewise constant functions with values {ai)j^iOn a finite number of bounded subintervals 
{Ji)i=i (which may reduce to a point or may not contain one or both of the end-points) of a bounded or 

unbounded interval J, with integral fj s{x)dx =^X]{=i '^il'^' l- While the Riemann integral of a bounded 
function f{x) on a bounded interval J is defined with respect to sequences of step functions {sj)'f^i 
and {tj)J^i satisfying Sj{x) < f{x) < tj{x) on J with fj{sj — tj) ^ as j ^ oo as R fjf{x)dx = 
lim fjSj{x)dx — lim f jtj{x)dx, the less restrictive Lebesgue integral is defined for arbitrary functions / 
over bounded or unbounded intervals J in terms of Cauchy sequences of step functions fj \si — Sfe| ^ 0, 
i,k c», converging to f(x) as 

Sj(x) — *■ f{x) pointwise almost everywhere on J, 

to be 

/ f{x)dx =^ lim / Sj{x)dx. 
J J i^^Jj 

That the Lebesgue integral is more general (and therefore is the proper candidate for completion of 
function spaces) is illustrated by the example of the function defined over [0, 1] to be on the rationals 
and 1 on the irrationals for which an application of the definitions verify that whereas the Riemann 
integral is undefined, the Lebesgue integral exists and has value 1. The Riemann integral of a bounded 
function over a bounded interval exists and is equal to its Lebesgue integral. Because it involves a larger 
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The functions in FigEfbl), 



k if < a; < 1/A; 
xe[a,b]- {0,1/k), 



Sk{x) = 

can be associated with the arbitrary indefinite integrals 



{0 a < X < 

kx 0<x<l/k 
1 l/k<x<b 

of Fig. |3Ib2), which, as noted above, converge in the mean to the unit step function Q(x); hence 

f^^Sky^ = fl^Sk^p = — fa ~^ ~ Jo ^'{x)dx = '/'(O)- But there can be no functional 

relation 6{x) for which 6{x)ip{x)dx = 97(0) for all ip £ CQ[a,/3], so that unhke in the case 

in Type 1 Equivalence, the limit in the mean 0(x) of the indefinite integrals 5^ ix) cannot 
he expressed as the indefinite integral 6^^^\x) of some function 6{x) on any interval containing 
the origin. This leads to the second more general type of equivalence 

Type 2 Equivalence. If (/fc) are functions on J that are integrable on every interior subin- 
terval, then the following are equivalent statements. 

(a) For every interior subinterval I of J there is an integer mj > 0, and hence a smallest 
integer m > 0, such that certain indefinite integrals of the functions fk converge in the 
mean on / to an integrable function Q which, unlike in Type 1 Equivalence, need not itself be 
an indefinite integral of some function /. 

(b) Ck{ip) = jj fk^ c{ip) for every £ C^{J). 

Since we are now given that Jj f^ "^\x)dx — > Jj'^{x)dx, it must also be true that ^'^ipi"^) 
converges in the mean to ^ip^"^^ whence 



(-'")(^M 



The natural question that arises at this stage is then: What is the nature of the relation (not 
function any more) ^{x)7 For this it is now stipulated, despite the non-equality in the equation 
above, that as in the mean m-integral convergence of {fk) to a function f, 

@{x) := lim 5i~^\x)= T 6{x')dx' (6) 

defines the non- functional relation ("generalized function") 5{x) integrally as a solution of the 
integral equation ^ of the first kind; hence formally^ 

= d^ 

End Tutorial2 



The above tells us that the "delta function" is not a function but its indefinite integral is 
the piecewise continuous function G obtained as the mean (or pointwise) limit of a sequence 
of non-differentiable functions with the integral of dQk{x)/dx being preserved for all k S 

family of functions, all integrals in integral convergences are to be understood in the Lebcsguc sense. 

^The observant reader cannot have failed to notice how mathematical ingenuity successfully transferred 
the "troubles" of {Sk)kLi to the sufficiently differentiable benevolent receptor ip so as to be able to work 

backward, via the resultant trouble free ((5^ ™'')fc°=ij to the final object S. This necessarily hides the true 
character of i5 to allow only a view of its integral manifestation on functions. This unfortunately is not 
general enough in the strongly nonlinear physical situations responsible for chaos, and is the main reason 
for constructing the multifunctional extension of function spaces that we use. 
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What then is the delta (and not its integral)? The answer to this question is contained in 
our multifunctional extension Multi(X, Y) of the function space Map{X, Y) considered in Sec. 
3. Our treatment of ill-posed problems is used to obtain an understanding and interpretation 
of the nurnerica l resul ts of the discretized spectral approximation in neutron transport theory 
SenguDtal (|l988l . Il995h . The main conclusions are the following: In a one-dimensional discrete 



system that is governed by the iterates of a nonlinear map, the dynamics is chaotic if and 
only if the system evolves to a state of maximal ill-posedness. The analysis is based on the 
non-injectivity, and hence ill-posedness, of the map; this may be viewed as a mathematical 
formulation of the stretch- and- fold and stretch- cut- and-paste kneading operations of the dough 
that are well-established artifacts in the theory of chaos and the concept of maximal ill-posedness 
helps in obtaining a physical understanding of the nature of chaos. We do this through the 
fundamental concept of t he graphical convergence of a sequence (generally a net) of functions 
Sengupta and Eav that is allowed to converge graphically, when the conditions are right. 



to a set-valued map or multifunction. Since ill-p osed problems n aturally lead to multifunctional 
inverses through functional generalized inverses ISenguntal (ll997l V it is natural to seek solutions 
of ill-posed problems in multifunctional space Multi(X, Y) rather than in spaces of functions 
Map(X, y); here Multi(X, K) is an extension of Ma,p(X,Y) that is generally larger than the 
smallest dense extension Multi|(X, y). 

Feedback and iteration are natural processes by which nature evolves itself. Thus almost 
every process of evolution is a self-correction process by which the system proceeds from the 
present to the future through a controlled mechanism of input and evaluation of the past. 
Evolution laws are inherently nonlinear and complex; here complexity is to be understood as 
the natural manifestation of the nonlinear laws that govern the evolution of t he system . 

This work presents a mathematical description of complexity based on Senguuta (Il997l) 



and iSengupta and Rav (2000i) and is organized as follows. In Sec. 1, we follow iSengupta 
to give an overview of ill-posed problems and their solution that forms the foundation 
of our approach. Sees. 2 to 4 apply these ideas by defining a chaotic dynamical system as a 
maximally ill-posed problem; by doi ng this w e are a ble to overcome the limitations of the three 
Devaney characterizations of chaos Devanev (|l989l ^ that apply to the specific case of iteration 
of transformations in a metric space, and the resulting graphical convergence of functions to 
multifunctions is the basic tool of our approach. Sec. 5 analyzes graphical convergence in 
Multi(X) for the discretized spectral approximation of neutron transport theory, which suggests 
a natural link between ill-posed problems and spectral theory of non- linear operators. This seems 
to off'er an answer to the question of why a natural system should increase its complexity, and 
eventually tend toward chaoticity, by becoming increasingly nonlinear. 



2. Ill-Posed Problem and its solution 

This section based on ISenguDtal (|l997l ^ presents a formulation and solution of ill-posed problems 
arising out of the non-injectivity of a function f : X ^ Y between topological spaces X and 
Y. A workable knowledge of this approach is necessary as our theory of chaos leading to the 
characterization of chaotic systems as being a maximally ill-posed state of a dynamical system is 
a direct application of these ideas and can be taken to constitute a mathematical representation 
of the familiar stretch- cut- and paste and stretch- and- fold paradigms of chaos. The problem of 
finding an x X for a given y £Y from the functional relation f{x) = y is an inverse problem 
that is ill-posed (or, the equation f{x) = y is ill-posed) if any one or more of the following 
conditions are satisfied. 

(IPl) / is not injective. This non-uniqueness problem of the solution for a given y is the 
single most significant criterion of ill-posedness used in this work. 

(IP2) / is not surjective. For a y £Y, this is the existence problem of the given equation. 

(IPS) When / is bijective, the inverse f~^ is not continuous, which means that small changes 
in y may lead to large changes in x. 
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A problem f{x) = y for which a solution exists, is unique, and small changes in data y 
lead to only small changes in the solution x is said to be well-posed or properly posed. This 
means that f{x) = y is well-posed if / is bijective and the inverse : Y ^ X is continuous; 
otherwise the equation is ill-posed or improperly posed. It is to be noted that the three criteria 
are not, in general, independent of each other. Thus if / represents a bijective, bounded linear 
operator between Banach spaces X and Y, then the inverse mapping theorem guarantees that 
the inverse is continuous. Hence ill-posedness depends not only on the algebraic structures 
of X, Y, f but also on the topologies of X and Y. 

Example 2.1. As a non-trivial example of an inverse problem, consider the heat equation 

de{x,t) _ 2^dix^ 
dt dx"^ 

for the temperature distribution 9{x, t) of a one-dimensional homogeneous rod of length L 
satisfying the initial condition 0(x,O) = 9q{x)., < x < L, and boundary conditions 6{0,t) = 
= 9{L,t), < t < T, having the Fourier sine-series solution 



V L 

n=l 



e{x, t) = Y,An sin (^x) e-^"* (8) 

where A„ = (c7r/a)n and 



An = ^ j Oo{x') sin {-^x'^ dx' 



are the Fourier expansion coefficients. While the direct problem evaluates 9{x,t) from the 
differential equation and initial temperature distribution 9q{x), the inverse problem calculates 
6q{x) from the integral equation 

2 

0^[x) = - k{x,x')eo{x')dx', 0<x<L, 
^ Jo 

when this final temperature 9t is known, and 

oo 

r f i\ sr^ ■ (n-n \ . /nir .\ ^x'^t 
k[x, x ) = sm y~JJ^j \ JJ'^ ) ^ " 

n=l 

is the kernel of the integral equation. In terms of the final temperature the distribution becomes 

oo 

eT{x)=Y,B.^sin{^x)e-'l('-^) (9) 

with Fourier coefficients 



L 

n=l 



In L^[0, a], Eqs. and © at t = T and t = yield respectively 



oo 

= ^E^ne-^'"^<e-^^?^||eof (10) 

n.=l 

L °° 



= f E^n-''"""- (11) 

n=l 

The last two equations differ from each other in the significant respect that whereas Eq. (|1()|) 
shows that the direct problem is well-posed according to (IP3), Eq. (jllj) means that in the 
absence of similar bounds the inverse problem is ill-posed. ^'^ ■ 



^Recall that for a linear operator continuity and boundedness are equivalent concepts. 
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Example 2.2. Consider the Volterra integral equation of the first kind 

y{x) = I r{x')dx' = Kr 

J a 

where y,r & C[a,b] and K : C[0, 1] — > C[0, 1] is the corresponding integral operator. Since 
the differential operator D = d/dx under the sup-norm ||r|| = supo<x.<i \r{x)\ is unbounded, 
the inverse problem r = Dy for a differentiable function y on [a, b] is ill-posed, see Example 
6.1. However, y = Kr becomes well-posed if y is considered to be in C^[0, 1] with norm ||y|| = 
supo<^<i \Dy\- This illustrates the importance of the topologies of X and Y in determining the 
ill-posed nature of the problem when this is due to (IP3). ■ 

Ill-posed problems in nonlinear mathematics of type (IPl) arising from the non-injectivity of 
/ can be considered to be a generalization of non- uniqueness of solutions of linear equations as, 
for example, in eigenvalue problems or in the solution of a system of linear algebraic equations 
with a larger number of unknowns than the number of equations. In both cases, for a given 
y the solution set of the equation f{x) = y is given by 

/-(y) = N/ = WgX: fix') = fix) = y}. 

A significant point of difference between linear and nonlinear problems is that unlike the special 
importance of in linear mathematics, there are no preferred elements in nonlinear problems; 
this leads to a shift of emphasis from the null space of linear problems to equivalence classes for 
nonlinear equations. To motivate the role of equivalence classes, let us consider the null spaces 
in the following linear problems. 

(a) Let / : ^ M be defined by fix, y) = x+y, (rr, y) £ M^. The null space of / is generated 
by the equation y = —x on the x-y plane, and the graph of / is the plane passing through the 
lines p = X and p = y. For each p E R the equivalence classes /~(p) = {(x, ?/) G : x -|- y = p} 
are lines on the graph parallel to the null set. 

(b) For a linear operator A: M" — > M"^, m < n, satisfying (1) and (2), the problem Ax = y 
reduces A to echelon form with rank r less than min{m,n}, when the given equations are 
consistent. The solution however, produces a generalized inverse leading to a set- valued inverse 
A~ of A for which the inverse images of y G ^(^) are multivalued because of the non-trivial 
null space of A introduced by assumption (1). Specifically, a null-space of dimension n — r 
is generated by the free variables {xj}'j^^^i which are arbitrary: this is illposedness of type 
(1). In addition, m — r rows of the row reduced echelon form of A have all entries that 
introduces restrictions on m — r coordinates {yi}'i^r+i V which are now related to {yiYi=i- 
this illustrates illposedness of type (2). Inverse ill-posed problems therefore generate multivalued 
solutions through a generalized inverse of the mapping. 

(c) The eigenvalue problem 

(^ + A2)y = y(0) = = y(l) 

has the following equivalence class of 

[0]b2 = {sin(7rmx)}^=o> = {d^/dx^ + A^) , 

as its eigenfunctions corresponding to the eigenvalues = vrm. 

Ill-posed problems are primarily of interest to us explicitly as noninjective maps /, that is 
under the condition of (IPl). The two other conditions (IP2) and (IP3) are not as significant 
and play only an implicit role in the theory. In its application to iterative systems, the degree 
of non-injectivity of / defined as the number of its injective branches, increases with iteration 
of the map. A necessary (but not sufficient) condition for chaos to occur is the increasing 
non-injectivity of / that is expressed descriptively in the chaos literature as stretch- and- fold or 
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stretch- cut- and-paste operations. This increasing noninjectivity that we discuss in the following 
sections, is what causes a dynamical system to tend toward ch aoticity. Ill-posedness arising 
from non-surjectivity of (injective) / in the form of regularization Tikhonov and ArseninI ( 19771 ) 
has received wide attention in the literature of ill-posed problems; this however is not of much 
significance in our work. 



Begin Tutorials: Generalized Inverse 

In this Tutorial, we take a quick look at the equation a{x) = y, where a: X ^ Y is a linear 
map that need not be either one-one or onto. Specifically, we will take X and Y to be the 
Euclidean spaces and M"^ so that a has a matrix representation A € M*"^" where R™^" 
is the collection of m x n matrices with real entries. The inverse exists and is unique 
iff m = n and rank(j4) = n; this is the situation depicted in Fig. Q^a). If A is neither one- 
one or onto, then we need to consider the multifunction A~ , a functional choice of which is 
known as the general i zed in verse G oi A. A good introductory text for generalized inverses is 
Campbell and Maver (jl979h FigureEr a) introduces the following definition of the Moore-Penrose 



generalized inverse Gmp- 

Definition 2.1. Moore-Penrose Inverse. If a: M" is a linear transformation with 

matrix representation A £ i^^x" then the Moore-Penrose inverse Gmp G of A (we will 

use the same notation Gmp '■ I^"* IK" for the inverse of the map a) is the noninjective map 
defined in terms of the row and column spaces of A, row(^) = TZlA"^), col(^) = Tl{A), as 

r 4^f / («lrow(A))~Hy)> ifyGcol(A) , . 




Figure 4: (a) Moore-Penrose generalized inverse. The decomposition of X and Y into the four fundamental 
subspaces of A comprising the null space M{A)^ the column (or range) space 7?.(y4), the row space 7^(A'^) and 
A/'(j4''"), the complement of Ti-iA) in Y , is a basic result in the theory of linear equations. The Moore-Penrose 
inverse takes advantage of the geometric orthogonality of the row space 'R,(^A^^ and Mi^A) in R" and that of the 
column space and A/'(A^) in R™. (b) When X and Y are not inner-product spaces, a non-injective inverse can be 
defined by extending ftoY — 7i{f) suitably as shown by the dashed curve, where g{x) :— ri + ((r2 — ri) /ri)f{x) 
for all X £ ©(/) was taken to be a good definition of an extension that replicates / in y — TZ{f); here xi ~ X2 
under both / and g, and yi ~ 1/2 under {/, g} just as b is equivalent to by in the Moore-Penrose case. Note that 
both {/,(?} and {/ ,p } are both multifunctions on X and Y respectively. Our inverse G, introduced later in 
this section, is however injective with G{Y — TZ{f)) := 0. 
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Note that the restriction a I 



row(j4) 



of a to Tl{A^) is bijective so that the inverse (a| 



vow{A)j 



is well-defined. The role of the transpose matrix appears naturally, and the Gmp of Eq. (|12j) is 
the unique matrix that satisfies the conditions 



AGupA = A, Gmp^Gmp = Gmp, 
(Gmp^)^ = GmpA, {AGmpV = AGup 



(13) 



that follow immediately from the definition ()12() : hence Gmp^ and AGmp are orthogonal pro- 
jections^^ onto the subspaces Tl{A"^) = TI{Gmp) and Tl{A) respectively. Recall that the range 
space TZ{A'^) of A^^ is the same as the row space iow{A) of A, and Ti-iA) is also known as the 
column space of A, col(A). 



Example 2.3. For a: 



let 



A 



( 1 
3 
2 

V2 



-3 2 1 2 \ 

-9 10 2 9 

-6 4 2 4 

-6 8 1 7 y 



By reducing the augmented matrix {A\y) to the row-reduced echelon form, it can be verified that 
the null and range spaces of A are 3- and 2-dimensional respectively. A basis for the null space 
of and of the row and column space of A obtained from the echelon form are respectively 



2\ 

1 




V 1/ 



1 \ 
1 



1 



and 



/ 1 \ 

-3 


3/2 
V 1/2 / 



V 



\ 


1 

-1/4 
3/4 / 



1 \ 


2 
1 



1 


VI/ 



According to its definition Eq. (|12|) , the Moore-Penrose inverse maps the middle two of the above 
set to (0, 0, 0, 0, 0)"^, and the A -image of the first two (which are respectively (19, 70, 38, hVj^ and 
(70, 275, 140, 205)^ lying, as they must, in the span of the last two), to the span of (1, —3, 2, 1, 2)'^ 
and (3, —9, 10, 2, 9)^ because a restricted to this subspace of is bijective. Hence 



Gmp 



/ 


( ^ ] 




( ° ^ 




-3 







A 





A 


1 




3/2 




-1/4 


\ 


V 1/2 ) 




I 3/4/ 





/ 1 








\ 




-3 
















1 










3/2 


-1/4 








/ 


V 1/2 


3/4 





/ 



The second matrix on the left is invertible as its rank is 4. This gives 



Gmp 



9/275 -1/275 18/275 -2/55 \ 

-27/275 3/275 -54/275 6/55 

-10/143 6/143 -20/143 16/143 

238/3575 -57/3575 476/3575 -59/715 

V -129/3575 106/3575 -258/3575 47/715 / 



(14) 



as the Moore-Penrose inverse of A that readily verifies all the four conditions of Eqs. (|13|) . 
The basic point here is that, as in the case of a bijective map, Gmp^ and AGmp are identities 
on the row and column spaces of A that define its rank. For later use — when we return 

^^A real matrix A is an orthogonal projector iS = A and A = A'^ . 
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to this example for a simpler inverse G — given below are the orthonormal bases of the four 
fundamental subspaces with respect to which Gmp is a representation of the generalized inverse 
of A] these calculations were done by MATLAB. The basis for 

(a) the column space of A consists of the first 2 columns of the eigenvectors of AA^ : 

(-1633/2585,-363/892, 3317/6387, 363/892)'^ 
(-929/1435, 709/1319, 346/6299,-709/1319)"^ 

(b) the null space of consists of the last 2 columns of the eigenvectors of AA^ : 

(-3185/8306, 293/2493,-3185/4153, 1777/3547)^ 
(323/1732, 533/731, 323/866, 1037/1911)^ 

(c) the row space of A consists of the first 2 columns of the eigenvectors of Al^ A: 

(421/13823, 44/14895,-569/918,-659/2526, 1036/1401) 
(661/690, 412/1775, 59/2960,-1523/10221,-303/3974) 

(d) the null space of A consists of the last 3 columns of the of A^ A: 

(-571/15469, -369/776, 149/25344, -291/350, -389/1365) 
(-281/1313, 956/1489, 875/1706,-1279/2847, 409/1473) 
(292/1579,-876/1579, 203/342, 621/4814, 1157/2152) 

The matrices Qi and Q2 with these eigenvectors (xj) satisfying ||xj|| = 1 and {xi,Xj) = for 
i 7^ j as their columns are orthogonal matrices with the simple inverse criterion = Q^. ■ 

End Tutorials 

The basic issue in the solution of the inverse ill-posed problem is its reduction to an well-posed 
one when restricted to suitable subspaces of the domain and range of A. Considerations of 
geometry leading to their decomposition into orthogonal subspaces is only an additional feature 
that is not central to the problem: recall from Eq. that any function / must necessarily 
satisfy the more general set-theoretic relations ff~f = f and f~ ff~ = f~ of Eq. (|13|) for the 
multiinverse /~ of / : X — > y. The second distinguishing feature of the MP-inverse is that it 
is defined, by a suitable extension, on all of Y and not just on f{X) which is perhaps more 
natural. The availability of orthogonality in inner-product spaces allows this extension to be 
made in an almost normal fashion. As we shall see below the additional geometric restriction 
of Eq. (|13() is not essential to the solution process, and infact, only results in a less canonical 
form of the inverse. 

Begin Tutorial^: Topological Spaces 

This Tutorial is meant to familiarize the reader with the basic principles of a topological space. 
A topological space {X^U) is a set X with a class^^ U of distinguished subsets, called open sets 
of X , that satisfy 

(Tl) The empty set and the whole X belong to U 

(T2) Finite intersections of members of li belong to lA 

(T3) Arbitrary unions of members of U belong to U. 

Example 2.4. (1) The smallest topology possible on a set X is its indiscrete topology when 
the only open sets are and X\ the largest is the discrete topology where every subset of X is 
open (and hence also closed). 

^^In this sense, a class is a set of sets. 
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(2) In a metric space {X,d), let Bi;{x,d) = {y £ X : d{x,y) < e} be an open ball at x. Any 
subset U oi X such that for each x £ U there is a d-ball B^(x,d) C [/ in [/, is said to be an 
open set of {X, d) . The collection of all these sets is the topology induced by d. The topological 
space {X,hl) is then said to be associated with (induced by) {X,d). 

(3) If ~ is an equivalence relation on a set X, the set of all saturated sets [x]^ = {y £ X : 
y ~ x} is a topology on X; this topology is called the topology of saturated sets. 

We argue in Sec. 4.2 that this constitutes the defining topology of a chaotic system. 

(4) For any subset A of the set X, the A-inclusion topology on X consists of and every 
superset of A, while the A- exclusion topology on X consists of all subsets of X — A. Thus A is 
open in the inclusion topology and closed in the exclusion, and in general every open set of one 
is closed in the other. 

The special cases of the a-inclusion and a-exclusion topologies for A = {a} are defined in a 
similar fashion. 

(5) The cofinite and cocountable topologies in which the open sets of an infinite (resp. un- 
countable) set X are respectively the complements of finite and countable subsets, are examples 
of topologies with some unusual properties that are covered in Appendix Al. If X is itself 
finite (respectively, countable), then its cofinite (respectively, cocountable) topology is the dis- 
crete topology consisting of all its subsets. It is therefore useful to adopt the convention, 
unless stated to the contrary, that cofinite and cocountable spaces are respectively infinite and 
uncountable. ■ 

In the space {X,U), a neighbourhood of a point x £ X is a nonempty subset of X that 
contains an open set U containing x; thus A^ C X is a neighbourhood of x iff 

x£U CN (15) 

for some U £ U. The largest open set that can be used here is Int(A^) (where, by definition, 
Int(74) is the largest open set that is contained in A) so that the above neighbourhood criterion 
for a subset N of X can be expressed in the equivalent form 

N Q X is aU — neighbourhood of x iff x G Inti^(A^) (16) 

implying that a subset of {X,U) is a neighbourhood of all its interior points, so that A^ £ Nx =^ 
A^ £ My for all y £ Int(A^). The collection of all neighbourhoods of x 

Afx=^ {N C X: x £U C N for some U £U} (17) 

is the neighbourhood system at x, and the subcollection U of the topology used in this equation 
constitutes a neighbourhood (local) base or basic neighbourhood system, at x, see Def. Al.l of 
Appendix Al. The properties 

(Nl) X belongs to every member A^ of Mx, 

(N2) The intersection of any two neighbourhoods of x is another neighbourhood of x: N,M £ 
Afx^ NClM £Mx, 

(N3) Every superset of any neighbourhood of x is a neighbourhood of x: (Af £ J\fx) A (M C 
N)^N £ Mx. 

that characterize Nx completely are a direct consequence of the definition ()15|). p6|) that may 
also be stated as 

(NO) Any neighbourhood A^ £ Nx contains another neighbourhood U of x that is a neigh- 
bourhood of each of its points: {{MN £ Nx)i3U £ Nx){U <^ N)): (iy £ U ^ U £ Ny). 

Property (NO) infact serves as the defining characteristic of an open set, and U can be 
identified with the largest open set Int(A^) contained in A^; hence a set G in a topological space 
is open iff it is a neighbourhood of each of its points. Accordingly if Nx is a given class of 
subsets of X associated with each x £ X satisfying (Nl) — (N3), then (NO) defines the special 
class of neighbourhoods G 

U = {G £ Nx - X £ B <^ G foi all X £ G and a basic nbd B £ Nx} (18) 
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as the unique topology on X that contains a basic neighbourhood of each of its points, for 
which the neighbourhood system at x coincides exactly with the assigned collection Nx] compare 
Def Al.l. Neighbourhoods in topological spaces are a generalization of the familiar notion of 
distances of metric spaces that quantifies "closeness" of points of X. 

A neighbourhood of a nonempty subset A oi X that will be needed later on is defined in 
a similar manner: is a neighbourhood of ^4 iff j4 C Int(A), that is j4 C [/ C A; thus the 
neighbourhood system at A is given by Ma = PlaeA-^" C A : G G Ma for every a ^ A} 

is the class of common neighbourhoods of each point of A. 

Some examples of neighbourhood systems at a point x in A are the following: 

(1) In an indiscrete space {X,L{), X is the only neighbourhood of every point of the space; 
in a discrete space any set containing x is a neighbourhood of the point. 

(2) In an infinite cofinite (or uncountable cocountable) space, every neighbourhood of a 
point is an open neighbourhood of that point. 

(3) In the topology of saturated sets under the equivalence relation ~, the neighbourhood 
system at x consists of all supersets of the equivalence class [x]^. 

(4) Let X £ X. In the x-inclusion topology, consists of all the non-empty open sets of A 
which are the supersets of {x}. For a point y ^ x of X, My are the supersets of {x, y}. 

For any given class of subsets of A, a unique topology U^tS) can always be constructed 
on A by taking all finite intersections t^a of members of S followed by arbitrary unions t^av of 
these finite intersections. 1{{tS) := t^av is the smallest topology on A that contains and is 
said to be generated by t5. For a given topology U on X satisfying = U{t:S), fS is a subbasis, 
and T^A := t-B a basis, for the topology U; for more on topological basis, see Appendix Al. The 
topology generated by a subbase essentially builds not from the collection itself but from the 
finite intersections t^a of its subsets; in comparison the base generates a topology directly from 
a collection t^S of subsets by forming their unions. Thus whereas any class of subsets can be used 
as a subbasis, a given collection must meet certain qualifications to pass the test of a base for a 
topology: these and related topics are covered in Appendix Al. Different subbases, therefore, 
can be used to generate different topologies on the same set A as the following examples for 
the case of A = M demonstrates; here (a, b), [a, b), (a, b] and [a, 6], for a < 6 G M, are the usual 
open-closed intervals in M^^. The subbases t^i = {(a,oo), (—00,6)}, ^•52 = {[a, 00), (—00,6)}, 
T^s = {(a, 00), (—00,6]} and ^S^ = {[a, 00), (—00,6]} give the respective bases t-Si = {(a, 6)}, 
= {[(^ift)}, tBs = {{a,b]} and = {[a,b]}, a < b £ W, leading to the standard (usual), 
lower limit (Sorgenfrey), upper limit, and discrete (take a = b) topologies on M. Bases of the 
type (a, 00) and (—00,6) provide the right and left ray topologies on M. 

This feasibility of generating different topologies on a set can be of great practical significance 
because open sets determine convergence characteristics of nets and continuity characteristics of 
functions, thereby making it possible for nature to play around with the structure of its working 
space in its kitchen to its best possible advantage}'^ 

Here are a few essential concepts and terminology for topological spaces. 

Definition 2.2. Boundary, Closure, Interior. The boundary of A in X is the set of points 
X £ X such that every neighbourhood N of x intersects both A and X — A: 

Bdy(A) ''^'{x G A: (VA G AA,)((An A / 0) A (AfKA - A) ^ 0))} (19) 
^^By definition, an interval / in a totally ordered set X is a subset of X with the property 

{xi,x2 e I) A e X : xi ^ X3 ^ X2) =^ X3 e I 

so that any element of X lying between two elements of / also belongs to /. 

^''Although we do not pursue this point of view here, it is nonetheless tempting to speculate that the 
answer to the question "Why does the entropy of an isolated system increase?" may be found by exploiting 
this line of reasoning that seeks to explain the increase in terms of a visible component associated with 
the usual topology as against a different latent workplace topology that governs the dynamics of nature. 
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where Mx is the neighbourhood system of Eq. i|I7)) at x. 

The closure of A is the set of all points x £ X such that each neighbourhood of x contains 
at least one point of A that may be x itself. Thus the set 

Cl{A) = {x G X : (ViV G Af^) {Nf]A^ 0)} (20) 

of all points in X adherent to A is given by is the union of A with its boundary. 
The interior of A 

lnt{A) = {xeX:{3Ne M^) {N C A)} (21) 

consisting of those points of X that are in A but not in its boundary, Int(A) = A — Bdy(A), 
is the largest open subset of X that is contained in A. Hence it follows that Int(Bdy(A)) = 0, 
the boundary of A is the intersection of the closures of A and X — A, and a subset N of X is a 
neighbourhood of x iff x £ Int(A^). □ 

The three subsets Int(yl), Bdy(A) and exterior of A defined as Ext(^) := Int(X — A) = 
X — Cl{A), are pairwise disjoint and have the full space X as their union. 

Definition 2.3. Derived and Isolated sets. Let A be a subset of X. A point x £ X (which 
may or may not be a point of A) is a cluster point of A if every neighbourhood N S Mx contains 
atleast one point of A different from x. The derived set of A 

Der(A) = {x E X: (ViV G A4) {Nf]{A - {x}) / 0)} (22) 
is the set of all cluster points of A. The complement of Dei (A) in A 

Iso(A) =^ A - Der(A) = Cl(^) - Der(^) (23) 

are the isolated points of A to which no proper sequence in A converges, that is there exists a 
neighbourhood of any such point that contains no other point of A so that the only sequence 
that converges to a £ Iso(^) is the constant sequence {a,a,a,- ■ ■). □ 

Clearly, 

C\{A) = A[j Der(^) = ^ U Bdy(^) 

= Iso(A) U Der(A) = Int(^) U Bdy(A) 

with the last two being disjoint unions, and A is closed iff A contains all its cluster points, 
Der(^) C A, iff A contains its closure. Hence 

A = C\{A) ^ C\{A) = {x£A: {{3N £ A4)(iV C A)) V ((ViV £ Mx)iNf]{X - A) ^ 0))} 

Comparison of Eqs. (|19j) and (|22j) also makes it clear that Bdy(j4) C Der(j4). The special case 
oi A = Iso(^) with Der(A) C X — j4 is important enough to deserve a special mention: 

Definition 2.4. Donor set. A proper, nonempty subset A of X such that lso{A) = A with 
Der(^) <^ X — A will be called self-isolated or donor. Thus sequences eventually in a donor set 
converges only in its complement; this is the opposite of the characteristic of a closed set where 
all converging sequences eventually in the set must necessarily converge in it. A closed-donor 
set with a closed neighbour has no derived or boundary sets, and will be said to be isolated in 
X. □ 

Example 2.5. In an isolated set sequences converge, if they have to, simultaneously in the 
complement (because it is donor) and in it (because it is closed). Convergent sequences in such 
a set can only be constant sequences. Physically, if we consider adherents to be contributions 
made by the dynamics of the corresponding sequences, then an isolated set is secluded from its 
neighbour in the sense that it neither receives any contributions from its surroundings, nor does 
it give away any. In this light and terminology, a closed set is a selfish set (recall that a set A 
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is closed in X iff every convergent net of X that is eventually in A converges in A; conversely a 
set is open in X iff the only nets that converge in A are eventually in it), whereas a set with a 
derived set that intersects itself and its complement may be considered to be neutral. Appendix 
A3 shows the various possibilities for the derived set and boundary of a subset A oi X. ■ 

Some useful properties of these concepts for a subset ^ of a topological space X are the 
following. 

(a) Bdy^(X) =0, 

(b) Bdy(^) = Cl(A)nCl(X-^), 

(c) lni{A) = X - CliX -A) = A- Bdy(^) = Cl(^) - Bdy(A), 

(d) Int(A)nBdy(^) = 0, 

(e) X = Int(A) U Bdy(^) U Int(A: - A), 
(f) 

Int(^) = [j{G C X : G is an open set of X contained in A} (24) 

(g) 

Cl(^) = f]{F C X : F is a closed set of X containing A} (25) 

A straightforward consequence of property (b) is that the boundary of any subset A of a 
topological space X is closed in X; this significant result may also be demonstrated as follows. 
If x G X is not in the boundary of A there is some neighbourhood of x that does not intersect 
both A and X — A. For each point y £ N, N is a neighbourhood of that point that does not 
meet A and X — A simultaneously so that A^ is contained wholly m X — Bdy(^). We may now 
take A^ to be open without any loss of generality implying thereby that X — Jidy{A) is an open 
set of X from which it follows that Bdy(^) is closed in X. 

Further material on topological spaces relevant to our work can be found in Appendix A3. 

End Tutorial4 

Worki ng in a general t opological space, we now recall the solution of an ill-posed problem 
f{x) = y Senguptal (j 19971 ) that leads to a multifunctional inverse /~ through the generalized 



inverse G. Let / : {X,U) {Y,V) be a (nonlinear) function between two topological space 
{X, U) and {Y, V) that is neither one-one or onto. Since / is not one-one, X can be partitioned 
into disjoint equivalence classes with respect to the equivalence relation xi X2 ^ fi^i) = 
/(X2). Picking a representative member from each of the classes (this is possible by the Axiom 
of Choice; see the following Tutorial) produces a basic set Xb of X; it is basic as it corresponds 
to the row space in the linear matrix example which is all that is needed for taking an inverse. 
Xb is the counterpart of the quotient set X/ of Sec. 1, with the important difference that 
whereas the points of the quotient set are the equivalence classes of X, Xb is a subset of X 
with each of the classes contributing a point to X-q- It then follows that /b : X^ — > f{X) 
is the bijective restriction a|i.ow(A) that reduces the original ill-posed problem to a well-posed 
one with Xb and f{X) corresponding respectively to the row and column spaces of A, and 
/g ^ : f{X) — > Xb is the basic inverse from which the multiinverse /~ is obtained through G, 
which in turn corresponds to the Moore-Penrose inverse Gmp- The topological considerations 
(obviously not for inner product spaces that applies to the Moore-Penrose inverse) needed to 
complete the solution are discussed below and in Appendix Al. 



Begin Tutorials: Axiom of Choice and Zorn's Lemma 

Since some of our basic arguments depend on it, this Tutorial contains a short description of the 
Axiom of Choice that has been described as "one of the most important, and at the same time 
one of the most controversial, principles of mathematics". What this axiom states is this: For 
any set X there exists a function fc- Vo{X) — > X such that fc{Aa) G Aa for every non-empty 
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subset Aa of X; here Vo{X) is the class of all subsets of X except 0. Thus, if X = {xi,X2,X3} 
is a three element set, a possible choice function is given by 

fci{xi,X2,X3}) = X3, fc{{xi,X2}) = Xi, fc{{x2,X3})=Xs, fc{{x3,Xl})=X3, 
fc{{xi}) = Xi, fc{{x2}) = X2, fc{{x3}) = X3. 

It must be appreciated that the axiom is only an existence result that asserts every set to have 
a choice function, even when nobody knows how to construct one in a specific case. Thus, for 
example, how does one pick out the isolated irrationals \/2 or vr from the uncountable reals? 
There is no doubt that they do exist, for we can construct a right angled triangle with sides of 
length 1 or a circle of radius 1. The axiom tells us that these choices are possible even though 
we do not know how exactly to do it; all that can be stated with confidence is that we can 
actually pick up rationals arbitrarily close to these irrationals. 

The axiom of choice is essentially meaningful when X is infinite as illustrated in the last two 
examples. This is so because even when X is denumerable, it would be physically impossible to 
make an infinite number of selections either all at a time or sequentially: the Axiom of Choice 
nevertheless tells us that this is possible. The real strength and utility of the Axiom however 
is when X and some or all of its subsets are uncountable as in the case of the choice of the 
single element vr from the reals. To see this more closely in the context of maps that we are 
concerned with, let / : X — > y be a non-injective, onto map. To construct a functional right 
inverse : y — > X of /, we must choose, for each y £ Y one representative element Xrop 
from the set f~{y) and define fr{y) to be that element according to / o fr{y) = f{xrcp) = y- 
If there is no preferred or natural way to make this choice, the axiom of choice allows us to 
make an arbitrary selection from the infinitely many that may be possible from f~[y). When 
a natural choice is indeed available, as for example in the case of the initial value problem 
y'{x) = x; y{0) = ao on [0, a], the definite solution ao + x'^/2 may be selected from the infinitely 
many x'dx' = < x < a that are permissible, and the axiom of choice sanctions this 

selection. In addition, each y £ Y gives rise to the family of solution sets Ay = {f~{y) : y G Y} 
and the real power of the axiom is its assertion that it is possible to make a choice fc{Ay) £ Ay 
on every Ay simultaneously; this permits the choice on every Ay of the collection to be made at 
the same time. 

Pause Tutorials 



Figure shows our Senguotal ( 19971 ^ formulation and solution of the inverse ill-posed problem 



f{x) = y. In sub-diagram X — Xb — f{X), the surjection p: X — > Xb is the counterpart of the 
quotient map Q of Fig. EJthat is known in the present context as the identification of X with Xb 
(as it identifies each saturated subset of X with its representative point in X^), with the space 
{X-Q,FT{U;p}) carrying the identification topology FT{U;p} being known as an identification 
space. By sub-diagram Y — Xb — f{X), the image f{X) of / gets the subspace topology^^ 
IT{j; V} from {Y,V) by the inclusion j : f{X) — > Y when its open sets are generated as, and 
only as, j~^{V) = Vf^f{X) for V £ V. Furthermore if the bijection /b connecting Xb and 
f{X) (which therefore acts as a 1 : 1 correspondence between their points, implying that these 
sets are set-theoretically identical except for their names) is image continuous, then by Theorem 
A2.1 of Appendix 2, so is the association q = /b o P - X ^ f{^) that associates saturated sets 
of X with elements of f{X); this makes f{X) look like an identification space of X by assigning 
to it the topology FT{Z^; q}. On the other hand if /b happens to be preimage continuous, then 

^^In a subspace A oi X, a, subset Ua of A is open iff Ua ^ Af]U for some open set U of X. The 
notion of subspace topology can be formalized with the help of the inclusion map i: A {X,U) that 
puts every point of A back to where it came from, thus 

Ua = {Ua = Af]U:U eU} 
= {i-iU):U eU}. 
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Xb acquires, by Theorem A2.2, the initial topology IT{e; V} by the embedding e : X-q — > Y 
that embeds Xb into Y through j o /g, making it look like a subspace of Y^^. In this dual 
situation, /b has the highly interesting topological property of being simultaneously image and 
preimage continuous when the open sets of Xb and f{X) — which are simply the /g ^-images of 
the open sets of f{X) which, in turn, are the /B-images of these saturated open sets — can be 
considered to have been generated by /b, and are respectively the smallest and larg est collection 
of subsets of X and Y that makes /b ini (tial-fi) naZ continuous Senguptal ( 19971 ) . A bijective 
ininal function such as /b is known as a homeomorphism and ininality for functions that are 
neither 1 : 1 nor onto is a generalization of homeomorphism for bijections; refer Eqs. ()119|) and 
(|120|) for a set-theoretic formulation of this distinction. A homeomorphism /: [X,U) — > (Y^V) 
renders the homeomorphic spaces {X, U) and (y, V) topologically indistinguishable which may 
be considered to be identical in as far as their topological properties are concerned. 




Figure 5: Solution of ill-posed problem fix) = y, f: X —>Y. G:Y^ Xb, a generalized inverse of / because 
of fGf = f and GfG = G which follows from the commutativity of the diagrams, is a functional selection of 
the multiinverse /~ : {Y,V) {X,U). </ and /< are the injective and surjective restrictions of /; these will be 
topologically denoted by their generic notations e and q respectively. 



Remark. It may be of some interest here to speculate on the significance of ininality in our 
work. Physically, a map / : {X,U) — > {Y,V) between two spaces can be taken to represent an 
interaction between them and the algebraic and topological characters of / determine the nature 
of this interaction. A simple bijection merely sets up a correspondence, that is an interaction, 
between every member of X with some member Y, whereas a continuous map establishes the 
correspondence among the special category of "open" sets. Open sets, as we see in Appendix 
Al, are the basic ingredients in the theory of convergence of sequences, nets and filters, and the 
characterization of open sets in terms of convergence, namely that a set G in X is open in it 
if every net or sequence that converges in X to a point in G is eventually in G, see Appendix 
Al, may be interpreted to mean that such sets represent groupings of elements that require 
membership of the group before permitting an element to belong it; an open set unlike its 
complement the closed or selfish set, however, does not forbid a net that has been eventually 
in it to settle down in its selfish neighbour, who nonetheless will never allow such a situation 
to develop in its own territory. An ininal map forces these well-defined and definite groups 
in {X,U) and {Y,V) to interact with each other through /; this is not possible with simple 
continuity as there may be open sets in X that are not derived from those of Y and non-open 
sets in Y whose inverse images are open in X. It is our hypothesis that the driving force behind 
the evolution of a system represented by the input-output relation f{x) = y is the attainment of 

^^A surjective function is an association iff it is image continuous and an injective function is an 
embedding iff it is preimage continuous. 
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the ininal triple state {X, f, Y) for the system. A preliminary analysis of this hypothesis is to 
be found in Sec. 4.2. 

For ininality of the interaction, it is therefore necessary to have 



FT{^/;/<} 
IT{</;V} 



IT{i;V} 
FT{l(;p}}; 



(26) 



in what follows we will refer to the injective and surjective restrictions of / by their generic 
topological symbols of embedding e and association q respectively. What are the topological 
characteristics of / in order that the requirements of Eq. (|26|) be met? From Appendix Al, it 
should be clear by superposing the two parts of Fig. I^over each other that given q: {X,U) 
{f {X), FT {U;q}) in the first of these equations, IT{j; V} will equal FT{U;q} iff j is an ininal 
open inclusion and Y receives FT{U; /}. In a similar manner, preimage continuity of e requires 
p to be open ininal and / to be preimage continuous if the second of Eq. (|2H)) is to be satisfied. 
Thus under the restrictions imposed by Eq. H26|) . the interaction / between X and Y must be 
such as to give X the smallest possible topology of /-saturated sets and Y the largest possible 
topology of images of all these sets: /, under these conditions, is an ininal transformation. 
Observe that a direct application of parts (b) of Theorems A2.1 and A2.2 to Fig. Elimplies that 
Eq. ()26() is satisfied iff /b is ininal, that is iff it is a homeomorphism. Ininality of / is simply a 
reflection of this as it is neither 1 : 1 nor onto. 

The /- and p-images of each saturated set of X are singletons in Y (these saturated sets 
in X arose, in the first place, as f~{{y}) for y £ Y) and in Xb respectively. This permits 
the embedding e = j o /b to give Xb the character of a virtual subspace of Y just as i makes 
f{X) a real subspace. Hence the inverse images p~{xr) = f~{e{xr)) with Xr G X-q, and 
Q~iy) = f~{Ky)) with y = fB{xr) G /(AT) are the same, and are just the corresponding /~ 
images via the injections e and i respectively. G, a left inverse of e, is a generalized inverse of 
/. G is a generalized inverse because the two set-theoretic defining requirements of fGf = f 
and GfG = G for the generalized inverse are satisfied, as Fig. shows, in the following forms 



In fact the commutativity embodied in these equalities is self evident from the fact that e = z/b 
is a left inverse of G, that is eG = ly. On putting back Ab into A by identifying each point of 
Ab with the set it came from yields the required set-valued inverse /~, and G may be viewed 
as a functional selection of the multiinverse /~. 

An injective branch of a function / in this work refers to the restrictions /b and its associated 
inverse /g^. 

The following example of an inverse ill-posed problem will be useful in fixing the notations 
introduced above. Let / on [0, 1] be the function of El 

Then f{x) = y is well-posed for [0,1/4), and ill-posed in [1/4,1]. There are two injective 
branches of / in {[1/4, 3/8)|J (5/8,1]}, and / is constant ill-posed in [3/8,5/8]. Hence the 
basic component /b of / can be taken to be /b(2;) = 2x for x G [0,3/8) having the inverse 
/g^(y) = x/2 with y S [0,3/4]. The generalized inverse is obtained by taking [0,3/4] as a 
subspace of [0,1], while the multiinverse /~ follows by associating with every point of the 
basic domain [0, 1]b = [0,3/8], the respective equivalent points [3/8]/ = [3/8,5/8] and [x]/ = 
{x, 7/4 — 3x} for x G [1/4, 3/8). Thus the inverses G and /~ of / are-*^^ 

^^If y ^ Ti-if) then f^{{y}) := which is true for any subset oiY—TZ{f). However from the set-theoretic 
definition of natural numbers that requires := 0, 1 = {0}, 2 = {0, 1} to be defined recursively, it follows 
that f^{y) can be identified with whenever y is not in the domain of /~. Formally, the successor 
set A+ = A\J{A} of A can be used to write := 0, 1 = 0+ = 0U{0}, 2 = 1+ = 1U{1} = {0}U{1} 
3 = 2+= 2|J{2} = {0}U{1}U{2} etc. Then the set of natural numbers N is defined to be the 
intersection of all the successor sets, where a successor set S is any set that contains and A+ whenever 
A belongs to S. Observe how in the successor notation, countable union of singleton integers recursively 
define the corresponding sum of integers. 



j7bG/ = / 



GJ/bG = G. 
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Figure 6: The function f{x) 



2x < x < 3/8 

3/4 3/8 < < 5/8 

7/6-2x/3 5/8<x<l. 



G{y) 



y/2, [0,3/4] 
0, y£ (3/4, 1] 



r{y) 



y/2, y G [0, 1/2) 

{y/2, 7/4 - 3y/2}, yG [1/2,3/4) 
[3/8,5/8], y = 3/4 

0, ye (3/4,1], 

which shows that /~ is multivalued. In order to avoid cumbersome notations, an injective 
branch of / will always refer to a representative basic branch /b, and its "inverse" will mean 
either /g or G. 

Example 2.3, Revisited. The row reduced echelon form of the augmented matrix {A\h) of 
Example 2.3 is 



/ 1 


-3 





3/2 


1/2 


bhi/2 - 62/2 


\ 








1 


-1/4 


3/4 


-361/4 + 62/4 



















-26i + 63 




V 














&1 - ^2 + &4 


/ 



The multifunctional solution x = A b, with 6 any element of Y 
image of a, is 



A~b = Gb + X2 



/3\ 
1 







+ X4 



/ -3/2 \ 


1/4 
1 

V 0/ 



+ 



(27) 

/ 

^ not necessarily in the of 



/ -1/2 \ 


-3/4 


1 / 



with its multifunctional character arising from the arbitrariness of the coefficients X2, X4, and 
X5. The generalized inverse 



G 



( 5/2 -1/2 \ 



-3/4 1/4 



V / 



is the unique matrix representation of the functional inverse a-^ : a 
defined according to^^ 

03^(6), if6G7^(a) 
0, if6Gy-7^(a), 



9{b) 



(28) 

X-Q extended to Y 
(29) 



'See footnote [T7I for a justification of the definition when b is not in 7^(a). 
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that bears comparison with the basic inverse 



/ 


5/2 


-1/2 





\ 


















-3/4 


1/4 






















V 











/ 



bi 
b2 
2bi 

\b2-b1 J 



between the 2-dimensional column and row spaces of A which is responsible for the particular 
solution of Ax = b. Thus G is simply acting on its domain a{X) considered a subspace of 
y, suitably extended to the whole of Y . That it is indeed a generalized inverse is readily seen 
through the matrix multiplications GAG and AG A that can be verified to reproduce G and A 
respectively. Comparison of Eqs. (|12|) and (|29j) shows that the Moore-Penrose inverse differs 
from ours through the geometrical constraints imposed in its definition, Eqs. H13|) . Of course, 
this results in a more complex inverse (|14)1 as compared to our very simple (|28j) : nevertheless it 
is true that both the inverses satisfy 



E((E(Gmp))' 



/ 1 \ 

10 



V / 

= E{{E{G))^) 

where E{A) is the row-reduced echelon form of A. The canonical simplicity of Eq. (|28|) as 
compared to Eq. ()14() is a general feature that suggests a more natural choice of bases by the 
map a than the orthogonal set imposed by Moore and Penrose. This is to be expected since 
the MP inverse, governed by Eq. ()13() . is a subset of our less restricted inverse described by 
only the first two of (|13j) : more specifically the difference is made clear in Fig. IHa) which shows 
that for any b ^ 7^(^), only Gmp(^±) = as compared to Gib) = 0. This seems to imply that 
introducing extraneous topological considerations into the purely set theoretic inversion process 
may not be a recommended way of inverting, and the simple bases comprising the row and null 
spaces of A and — that are mutually orthogonal just as those of the Moore-Penrose — are 
a better choice for the particular problem Ax = b than the general orthonormal bases that the 
MP inverse introduces. These "good" bases, with respect to which the generalized inverse G 
has a considerably simpler representation, are obtained in a straight forward manner from the 
row-reduced forms of A and . These bases are 

(a) The column space of A is spanned by the columns (1, 3, 2, 2)"'" and (1, 5, 2, 4)'^ of A 
that correspond to the basic columns containing the leading I's in the row-reduced form of A^ 

(b) The null space of Al^ is spanned by the solutions (—2, 0, 1, 0)"*" and (1,-1, 0, 1)"*" of 
the equation A^b = 0, 

(c) The row space of A is spanned by the rows (1, —3, 2, 1, 2) and (3, —9, 10, 2, 9) of A 
corresponding to the non-zero rows in the row-reduced form of A, 

(d) The null space of A is spanned by the solutions (3, 1, 0, 0, 0), (—6, 0, 1, 4, 0), and 
(-2, 0, -3, 0, 4) of the equation Ax = {). ■ 

The main differences between the natural "good" bases and the MP-bases that are responsible 
for the difference in form of the inverses, is that the later have the additional restrictions of being 
orthogonal to each other (recall the orthogonality property of the Q-matrices), and the more 
severe one of basis vectors mapping onto basis vectors according to Axi = aibi, i = 1, • • • ,r, 
where the {xi}^^-^ and are the eigenvectors of Al^ A and AAl^ respectively and 

are the positive square roots of the non-zero eigenvalues of A (or of AA^), with r denoting 
the dimension of the row or column space. This is considered as a serious restriction as the 
linear combination of the basis {6^} that Axi should otherwise have been equal to, allows a 
greater flexibility in the matrix representation of the inverse that shows up in the structure 
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of G. These are, in fact, quite general considerations in the matrix representation of hnear 
operators; thus the basis that diagonahzes an n x n matrix (when this is possible) is not the 
standard "diagonal" orthonormal basis of M", but a problem-dependent, less canonical, basis 
consisting of the n eigenvectors of the matrix. The 0-rows of the inverse of Eq. (|28|) result 
from the 3-dimensional null-space variables X2, X4, and xs, while the 0-columns come from the 
2-dimensional image-space dependency of 63 , 64 on bi and 62 1 that is from the last two zero rows 
of the reduced echelon form ()27() of the augmented matrix. 

We will return to this theme of the generation of a most appropriate problem-dependent 
topology for a given space in the more general context of chaos in Sec. 4.2. 

In concluding this introduction to generalized inverses we note that the inverse G of / comes 
very close to being a right inverse: thus even though AG 7^ I2 its row-reduced form 

/ 1 \ 

10 



V / 

is to be compared with the corresponding less satisfactory 

/I 2 -1 \ 

10 1 



V / 

representation of AGup- 



3. Multifunctional extension of function spaces 

The previous section has considered the solution of ill-posed problems as multifunctions and 
has shown how this solution may be constructed. Here we introduce the multifunction space 
Multi| {X) as the first step toward obtaining a smallest de nse extension Multi f X) of the function 
space Map(X). Multi|(X) is basic to our theory of chaos ISenguota and in the sense 



that a chaotic state of a system can be fully described by such an indeterminate multifunctional 
state. In fact, multifunctions also enter in a natural way in describing the spectrum of nonlinear 
functions that we consider in Section 6; this is required to complete the construction of the 
smallest extension Multi(X) of the function space Map(X). The main tool in obtaining the 
space Multi|(X) from Map(X) is a generalization of the technique of pointwise convergence 
of continuous functions to (discontinuous) functions. In the analysis below, we consider nets 
instead of sequences as the spaces concerned, like the topology of pointwise convergence, may 
not be first countable. Appendix Al. 



3. 1 . Graphical convergence of a net of functions 

Let {X,U) and (y, V) be Hausdorff spaces and {fa)a& : X hea. net of piecewise continuous 
functions, not necessarily with the same domain or range, and suppose that for each a G B there 
is a finite set = {1; 2, • • • P^} such that /~ has Pa functional branches possibly with different 
domains; obviously la is a singleton iff / is a injective. For each a G B, define functions 
{9ai)i&i^ --Y ^ X such that 

fadaifa — fai ^ ~ l-i 2, • • • Pa, 

where /^^ is a basic injective branch of fa on some subset of its domain: Qaifai = Ix on V{f^-), 
faidai = ou T>{gai) for each i ^ la- The use of nets and filters is dictated by the fact that 
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we do not assume X and Y to be first countable. In the application to the theory of dynamical 
systems that follows, X and Y are compact subsets of M when the use of sequences suffice. 

In terms of the residual and cofinal subsets Res(D) and Cof(I])) of a directed set B (Def. 
A 1.7), with X and y in the equations below being taken to belong to the required domains, 
define subsets P_ of X and TZ^ of Y as 

P_ = {x G X: ((/^(x))^gD converges in (F, V))} (30) 
TZ- = {y<^Y: {3i <^ I y){{g^i{y))y(,n cortex ges m {X,U))] (31) 

Thus: 

is the set of points of X on which the values of a given net of functions {fa)a&'B converge 
pointwise in Y . Explicitly, this is the subset of X on which subnets^^ in Map(X, Y) combine to 
form a net of functions that converge pointwise to a limit function F : D_ — > Y . 

TZ- is the set of points of Y on which the values of the nets in X generated by the injective 
branches of {fa)a£D converge pointwise in Y. Explicitly, this is the subset of Y on which subnets 
of injective branches of {fa)aGD in Map(y, X) combine to form a net of functions that converge 
pointwise to a family of limit functions G : TZ^ — > X. Depending on the nature of {fa)aGO^ 
there may be more than one TZ- with a corresponding family of limit functions on each of them. 
To simplify the notation, we will usually let G : TZ- X denote all the limit functions on all 
the sets TZ-. 

If we consider cofinal rather than residual subsets of B then corresponding B+ and can 
be expressed as 

V+ = {x e X : {{fu{x))uGCof{l}) converges in {Y,V))} (32) 

TZ+ = {y eY: {3i e Iv){{gui{y))v&Coi(n) converges in (X,W))}. (33) 

It is to be noted that the conditions P+ = P_ and 7^+ = TZ- are necessary and sufficient for 
the Kuratowski convergence to exist. Since and TZj^ differ from !)_ and TZ- only in having 
cofinal subsets of D replaced by residual ones, and since residual sets are also cofinal, it follows 
that V- C and TZ- C TZj^. The sets V- and TZ- serve for the convergence of a net of 
functions just as P+ and 7^+ are for the convergence of subnets of the nets {adherence). The 
later sets are needed when subsequences are to be considered as sequences in their own right 
as, for example, in dynamical systems theory in the case of w-limit sets. 

As an illustration of these definitions, consider the sequence of injective functions on the 
interval [0, 1] /„(x) = 2"x, for x G [0, 1/2"] , n = 0, 1, 2 • • • . Then B0.2 is the set {0, 1, 2} and 
only Bq is eventual in B. Hence V- is the single point set {0}. On the other hand By is eventual 
in B for all y and TZ- is [0, 1]. 

Definition 3.1. Graphical Convergence of a net of functions. A net of functions 
{fa)aeD ■ {X,U) (X,V) is said to converge graphically if either V- ^ % or TZ- / 0; in this 
case let F : T>- Y and G : TZ- — > X be the entire collection of limit functions. Because of 
the assumed Hausdorffness of X and Y, these limits are well defined. 

The graph of the graphical limit M of the net (fa) ■ {X,U) — > {Y,V) denoted by fa — > M, 
is the subset ofT>- x TZ-that is the union of the graphs of the function F and the multifunction 
G- 

Gh = Gp [J G^- 

where 

GG-={{x,y)eXxY:{y,x)eGGQYxX}. □ 



^'■'A subnet is the generalized uncountable equivalent of a subsequence; for the technical definition, see 
Appendix Al. 



28 



Begin TutoriaW: Graphical Convergence 

The following two examples are basic to the understanding of the graphical convergence of 
functions to multifunctions and were the examples that motivated our search of an acceptable 
technique that did not require vertical portions of limit relations to disappear simply because 
they were non- functions: the disturbing question that needed an answer was how not to math- 
ematically sacrifice these extremely significant physical components of the limiting correspon- 
dences. Furthermore, it appears to be quite plausible to expect a physical interaction between 
two spaces X and y to be a consequence of both the direct interaction represented by / : X — > 1" 
and also the inverse interaction f~:Y^X, and our formulation of pointwise biconvergence is 
a formalization of this idea. Thus the basic examples (1) and (2) below produce multifunctions 
instead of discontinuous functions that would be obtained by the usual pointwise limit. 

Example 3.1. (1) 



fnix) 



-1 < X < 

nx 0<x<l/n : [-1, 1] ^ [0, 1] 

1 l/n<x<l 

gn{y)=y/n: [0, l] ^ [0, 



Then 



F{x) 



-1 < X < 
< X < 1 



on P„ =P+ = [-1,0]|J(0,1] 



G{y) =0 on 7^_ = [0, 1] = TZ^ 



The graphical limit is ([-1, 0], 0) U(0, [0,1])U((0, 

(2) /n(x) = nx for x G [0, l/n] gives gn{y) = y/n : [0, 1] [0,1/n]. Then 



F(x) = on = {0} = 2?H 



G{y) = on 7^_ = [0, 1] = 7^^ 



The graphical hmit is (0, [0, 1]). 



In these examples that we consider to be the prototypes of graphical convergence of functions 
to multifunctions, G{y) = on TZ- because gn{y) for all y £ TZ-. Compare the graphical 
multifunctional limits with the corresponding nHnal pointwise fimrtinnal lirnits characterized by 
discontinuity at x = 0. Two more examples from ISengupta and Ravi ((20001) that illustrate this 
new convergence principle tailored specifically to capture one-to-many relations are shown in 
Fig. [3 which also provides an example in Fig. Efc) of a function whose iterates do not converge 
graphically because in this case both the sets P_ and 7^_are empty. The power of graphical 
convergence in capturing multifunctional limits is further demonstrated by the example of the 
sequence (sinn7rx)J^^ that converges to both 1-integrally and test-functionally, Eqs. ^ and 

It is necessary to understand how the concepts of eventually in and frequently in of Appendix 
A2 apply in examples (a) and (b) of Fig. d In these two examples we have two subsequences 
one each for the even indices and the other for the odd. For a point-to-point functional relation, 
this would mean that the sequence frequents the adherence set adh(x) of the sequence (x„) 
but does not converge anywhere as it is not eventually in every neighbourhood of any point. 
For a multifunctional limit however it is possible, as demonstrated by these examples, for the 
subsequences to be eventually in every neighbourhood of certain subsets common to the eventual 
limiting sets of the subsequences; this intersection of the subsequential limits is now defined to 
be the limit of the original sequence. A similar situation obtains, for example, in the solution of 
simultaneous equations: The solution of the equation anxi -|- 012X2 = bi for one of the variables 
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3 ./ 
/ 

n odd 



(b) 




Figure 7: The graphical hmits are; (a) F(x) 
TZ- = [0,1]. Also G 



1 for < X < 1 



on V- = [0, 1] U(l, 2], and G(y) = 1 on 



for 1 < j; < 2 

1 on 7^+ = [0,3/2] 
1 on 7^+ = [-1/2, 1] ■ 

(b) F{x) = 1 on = {0} and G{y) = on 7^- = {1}. Also F{x) = -1/2, 0, 1, 3/2 respectively on 
D+ = (0,3], {2}, {0}, (0,2) andG(j/) = 0, 0, 2, 3 respectively on 7^+ = (-1/2, 1], [1,3/2), [0,3/2), [-1/2,0). 

(c) For f(x) = —0.05 + x ~ x^, no graphical limit as T)- — = TZ-. 

(d) For f{x) = 0.7 + a; — i^, -F(2;) = a on D_ = [a, c], Gi{y) = a and G2(y) = c on 7?— = ( — oo, a]. Notice how 
the two fixed points and their equivalent images define the converged limit rectangular multi. As in example (1) 
one has !>_ — X>+; also TZ- = TZ+. 
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X2 say with 012 7^ 0, is the set represented by the straight Une X2 = mixi + ci for all xi in its 
domain, while for a different set of constants 021, 022 and 62 the solution is the entirely different 
set X2 = m2Xi + C2, under the assumption that mi 7^ m2 and ci 7^ C2. Thus even though the 
individual equations (subsequences) of the simultaneous set of equations (sequence) may have 
distinct solutions (limits), the solution of the equations is their common point of intersection. 

Considered as sets in XxY, the discussion of convergence of a sequence of graphs /„ : X — > y 
would be incomplete without a mention of the convergence of a sequence of sets under the 
Hausdorff metric that is so basic in the study of fractals. In this case, one talks about the 
convergence of a sequence of compact subsets of the metric space so that the sequences, 
as also the limit points that are the fractals, are compact subsets of ffi". Let /C denote the 
collection of all nonempty compact subsets of M". Then the Hausdorff metric dn between two 
sets on /C is defined to be 

du{E, F) = max{J(S, F), 6{F, E)} E,F £}C, 

where 

6{E,F)=max min||x — y||2 

is 6{E, F) is the non-symmetric 2-norm in M^. The power and utility of the Hausdorff distance 
is best understood in terms of the dilations E + e := Uxes ^ei^) of a subset E of M" by e where 
Df,{x) is a closed ball of radius e at x; physically a dilation of by e is a closed e-neighbourhood 
of E. Then a fundamental prope rty of i s that dn^E, F) < e iff both E C F-\-e and F C E + e 
hold simultaneously which leads iFalconer to the interesting consequence that 



If {Fn)^^i and F are nonempty compact sets, then limji_>oo Fn = F in the Hausdorff metric 
iff En C F + e and F O Fn + £ eventually. Furthermore if (Fn)5^i is a decreasing sequence of 
elements of a filter-hase in M", then the nonempty and compact limit set F is given by 



lim F„ = F = n En. 

n— »oo ' I 



Note that since M" is Hausdorff, the assumed compactness of En ensures that they are also closed 
in M""; F, therefore, is just the adherent set of the filter-base. In the deterministic algorithm 
for the generation of fractals by the so-called iterated function system (IFS) approach, F„, is 
the inverse image by the n*^ iterate of a non-injective function / having a finite number of 
injective branches and converging graphically to a multifunction. Under the conditions stated 
above, the Hausdorff metric ensures convergence of any class of compact subsets in M". It 
appears eminently plausible that our multifunctional graphical convergence on Map(M") implies 
Hausdorff convergence on M"': in fact pointwise biconvergence involves simultaneous convergence 
of image and preimage nets on Y and X respectively. Thus confining ourselves to the simpler 
case of pointwise convergence, if {fa)a£0 is a net of functions in Map{X,Y), then the following 
theorem expresses the link between convergence in Map(X, Y) and in Y. 

Theorem 3.1. A net of functions {fa)aeo converges to a function f in (Ma.p{X, Y),T) in the 
topology of pointwise convergence iff (fa) converges pointwise to f in the sense that fa{x) — > 
/(x) in Y for every x in X. □ 

Proof. Necessity. First consider fa ^ f in (Map(X, y), T). For an open neighbourhood V 
of /(x) in Y with x £ X, let B{x;V) be a local neighbourhood of / in (Map{X, Y),T), see 
Eq. (|78|) in Appendix Al. By assumption of convergence, (fa) must eventually be in B{x; V) 
implying that fa{x) is eventually in V. Hence /q,(x) — > /(x) in Y. 

Sufficiency. Conversely, if fa{x) — > f{x) in Y for every x G X, then for a finite collection of 
points {xi)l^-^ oi X {X may itself be uncountable) and corresponding open sets (Vi)[^^ in Y with 
/(xj) S Vi, let B{{xi)l^^; {Vi)l^i) be an open neighbourhood of /. From the assumed pointwise 
convergence faixi) — > /(xj) in y for i = 1.2. • • • .1, it follows that {fa{xi)) is eventually in Vi 
for every (xi)^^^. Because B is a directed set, the existence of a residual applicable globally 
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for all i = 1, 2, • • • , / is assured leading to the conclusion that fa{xi) G Vi eventually for every 
i = 1,2, ••• , /. Hence S B{{xi)l^'^;{Vi)l^^) eventually; this completes the demonstration 
that fa^f^n (Map(X, y), T), and thus of the proof. ■ 

End Tutorials 



3.2. The Extension Multi of Map{X,Y) 

In this Section we show how the topological treatment of pointwise convergence of functions to 
functions given in Example Al.l of Appendix 1 can be generalized to generate the boundary 
Multi| (X, y) between Map{X,Y) and Multi(X, y); here X and Y are Hausdorff spaces and 
Map(X, y) and Multi(X, Y) are respectively the sets of all functional and non-functional rela- 
tions between X and Y. The generalization we seek defines neighbourhoods of / G Map(X, Y) 
to consist of those functional relations in Multi(X, Y) whose images at any point x X lies not 
only arbitrarily close to f{x) (this generates the usual topology of pointwise convergence 7y of 
Example Al.l) but whose inverse images at y = f{x) € Y contain points arbitrarily close to x. 
Thus the graph of / must not only lie close enough to f{x) at x in V, but must additionally 
be such that f~{y) has at least branch in U about x; thus / is constrained to cling to / as the 
number of points on the graph of / increases with convergence and, unlike in the situation of 
simple pointwise convergence, no gaps in the graph of the limit object is permitted not only, as 
in Example Al.l on the domain of /, but simultaneously on it range too. We call the resulting 
generated topology the topology of pointwise biconvergence on Map (A", y), to be denoted by T. 
Thus for any given integer / > 1, the generalization of Eq. ()78|) gives for i = 1, 2, • • • ,1, the 
open sets of (Map(A, y),T) to be 

Biix,), (Vi); (y,), (Ui)) = {g e Map(A, Y) : 

i9{xi)eVi)Aig-{y,)f]U,^$) , i = 1, 2, • • • , 7}, (34) 

where {Vi)^^^ are as in that example, (yi)[=i G Y, and the corresponding open sets 

{Ui)l^i in X are chosen arbitrarily^'^. A local base at /, for {xi,yi) G Gj, is the set of functions 
of with yi = f{xi) and the collection of all local bases 

Ba = Bi{x,)ti, iV^)tv iy^)tv mti). (35) 

for every choice of a G D, is a base t-S of (Map (A, y), T). Here the directed set B is used as an 
indexing tool because, as pointed out in Example Al.l, the topology of pointwise convergence 
is not first countable. 

In a manner similar to Eq. (|34j) . the open sets of (Multi(A, y), T), where Multi(A, y) are 
multifunctions with only countably many values in Y for every point of X (so that we exclude 
continuous regions from our discussion except for the "vertical lines" of Multi| (A, y)), can be 
defined by 

B{{xi),{yi);{y^),{U,)) = {S G Multi(A,y) : {^{x,){~]Vi / 0) A {r{yi){~]U.i + 0)} (36) 
where 

g-(y) = {xGA:yGg(x)}. 

and (a;i)[=i G T^{%),^i)\=\\{yi)\=x G 'R{'S),{Ui)\=x are chosen as in the above. The topology 
T of Multi(A, y) is generated by the collection of all local bases B^ for every choice of a G D, 
and it is not difficult to see from Eqs. (|5H) and that the restriction T |Map(x,r) of T to 
Map(A, y) is just T. 

^"Equation is essentially the intersection of the pointwise topologies (|7Hll due to / and 
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Henceforth T and T will be denoted by the same symbol T, and convergence in the topology 
of pointwise biconvergence in {Multi{X, Y),T) will be denoted by with the notation being 
derived from Theorem 3.1. 

Definition 3.2. Functionization of a multifunction. A net of functions {fa)aeB in 
Map{X,Y) converges in (Multi(X, y), T), fa ^ M, if it hiconverges pointwise in {Map{X, Y),T*). 
Such a net of functions will be said to be a functionization of M. □ 

Theorem 3.2. Let {fa)aeD be a net of functions in Map{X ,Y) . Then 

fa-^M^ fa^M. □ 

Proof. If (fa) converges graphically to M then either 2?_ or is nonempty; let us assume 
both of them to be so. Then the sequence of functions (fa) converges pointwise to a function 
F on P_ and to functions G on TZ-, and the local basic neighbourhoods of F and G generate 
the topology of pointwise biconvergence. 

Conversely, for pointwise biconvergence on X and Y, TZ^ and P_ must be non-empty. ■ 




Fi gure 8: The power of graphical convergence, illustrated for Example 3.1 (1), shows a local neighbourhood of 
the functions x and 2x in figures (a) and (b) at the four points {xi)f^i with corresponding neighbourhoods (Ui)i^i 
and {Vi)f-i at {xi,f{xi)) in R in the X and Y directions respectively, see Eqs. 1341 and (1781 for the notations. 
In (a) is shown a function g in a pointwise neighbourhood of / determined by the open sets Vi, while (b) shows 
g in a graphical neighbourhood of / due to both Ui and Vi. A comparison of these figures demonstrates how 
the graphical neighbourhood forces functions close to / to remain closer to it than if they were in its pointwise 
neighbourhood. This property is clearly visible in (a) where g, if it were to be in a graphical neighbourhood 
of /, would be more faithful to it by having to be also in U2 and U4. Thus in this case not only the must the 
images f{xij) f{xi) as the Vi decrease, but so also must the preimages Xij Xi with shrinking Ui. It is this 
simultaneous convergence of both images and preimages at every x that makes graphical convergence a natural 
candidate for multifunctional convergence of functions. 



Observe that the boundary of Map(X, Y) in the topology of pointwise biconvergence is a 
"line parallel to the Y-axis". We denote this closure of Map(X, Y) as 

Definition 3.3. Multi| ((X, y), T) = Cl(Map((X, y), T)). □ 

The sense in which Multi| {X, Y) is the smallest closed topological extension of M = Map(X, Y) 
is the following, refer Thm. Al.4 and its proof. Let (M, Tq) be a topological space and suppose 
that 

M = M[J{in} 
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is obtained by adjoining an extra point to M; here M = Map(X, Y) and in G C1(M) is the 
multifunctional limit in M = Multi|(X, y). Treat all open sets of M generated by local bases 
of the type (jSSl) with finite intersection property as a filter-base pB on X that induces a filter 
on M (by forming supersets of all elements of pB; see Appendix Al) and thereby the filter-base 

^ = {B = B [j{m} -.B € fB} 

on M; this filter-base at m can also be obtained independently from Eq. H36|) . Obviously pB is 
an extension of pB on M and pB is the filter induced on M by pB. We may also consider the 
filter-base to be a topological base on M that defines a coarser topology T on M (through all 
unions of members of pB) and hence the topology 

f = {G = G\J{m}:GeT} 

on M to be the topology associated with J^. A finer topology on M may be obtained by adding 
to T all the discarded elements of Tq that do not satisfy FIP. It is clear that fh is on the 
boundary of M because every neighbourhood of fh intersects M by construction; thus (M, T) 
is dense in {M,T) which is the required topological extension of {M,T). 

In the present case, a filter-base at / G M.ap{X,Y) is the neighbourhood system pBf at / 
given by decreasing sequences of neighbourhoods (Vfc) and (Uk) of f{x) and x respectively, and 
the filter is the neighbourhood filter A//IJG where G GMulti| (X, y). We shall present an 
alternate, and perhaps more intuitively appealing, description of graphical convergence based 
on the adherence set of a filter on Sec. 4.1. 

As more serious examples of the graphical convergence of a net of functions to multifunction 
than those considered above, Fig. IHl shows the first four iterates of the tent map 

i 2x < X < 1/2 ^ , 

t(x) = i - ' it^=t). 

^ ^ \ 2(1 -x) 1/2 < X < 1 ^ ' 

defined on [0.1] and the sine map /„ = | sin(2"'~^7rx)|, n = 1, • • • ,4 with domain [0, 1]. 

These examples illustrate the important generalization that periodic points may be replaced 
by the more general equivalence classes where a sequence of functions converges graphically; 
this generalization based on the ill-posed interpretation of dynamical systems is significant for 
non-iterative systems as in second example above. The equivalence classes of the tent map for 
its two fixed points and 2/3 generated by the first 4 iterates are 

r 113 15 3 7 
[0]4 = iO,-, 1 



4' 8' 2' 8' 4' 



1 1 3 1 5 3 7 , 

c, - T c, - =F c, - =F c, - =F c, - =F c, - =F c, - =F c, 1 — c 
'8 4 8 2 8 4 8 



4 

where c = 1/24. If the moduli of the slopes of the graphs passing through these equivalent fixed 
points are greater than 1 then the graphs converge to multifunctions and when these slopes 
are less than 1 the corresponding graphs converge to constant functions. It is to be noted that 
the number of equivalent fixed points in a class increases with the number of iterations k as 
2^-1 _j_ increase in the degree of ill-posedness is typical of discrete chaotic systems and 

can be regarded as a paradigm of chaos generated by the convergence of a family of functions. 

The m**^ iterate t"^ of the tent map has 2™ fixed points corresponding to the 2"^ injective 
branches of t"^ 

^ J = l,3,--- ,(2--l^ 



t {xjyij^ — Xjjij, j — 1, 2, • • • ,2 
— - — , j = 2,4,--- ,2™, 
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Figure 9: The first four iterates of (a) tent and (b) | sin(2"~^7ra;)| maps siiow tlie formal similarity of the 
dynamics of these functions. It should be noted, as shown in Fig. |7| that although sm{nnx)^^i fails to converge 
at any point other than and 1, the subsequence sin(2''~^7ra:)^i does converge graphically on a set dense in 
[0,1]. 

Let Xm be the collection of these 2™ fixed points (thus Xi = {0,2/3}), and denote by [X^] 
the set of the equivalent points, one coming from each of the injective branches, for each of the 
fixed points: thus 

V.= [X,] = {[0],[2/3]} 

[X,] = {[0], [2/5], [2/3], [4/5]} 

and = nm=i[^»"] ^ nonempty countable set dense in X at each of which the graphs of 
the sequence (t™) converge to a multifunction. New sets [Xn] will be formed by subsequences 
of the higher iterates for m = in with i = 1, 2, • • • where these subsequences remain fixed. 
For example, the fixed points 2/5 and 4/5 produced respectively by the second and fourth 
injective branches of t^, are also fixed for the seventh and thirteenth branches of t^. For the 
shift map 2x mod(l) on [0,1], P_ = {[0],[1]} where [0] = {^Z=l{{^ - l)/2"': i = 1,2,--- ,2™} 
and [l]=n^=i{i/2™:i = l,2,... ,2™}. 

It is useful to compare the graphical convergence of (sin(7rnx))^]^ to [0, 1] at and to 
at 1 with the usual integral and test-functional convergences to 0; note that the point 1/2, 
for example, belongs to P+and not to P_ = {0, 1} because it is frequented by even n only. 
However for the subsequence (/2™-i)m6Z+, 1/2 is in because if the graph of /2m-i passes 
through (1/2,0) for some m, then so do the graphs for all higher values . Therefore [0] = 
nm=i{V2'"~"'^ '■ i = 0, 1,-- - ,2"^~^} is the equivalence class of (/2™-i)m=i ^^"^ this sequence 
converges to [—1, 1] on this set. Thus our extension Multi(X) is distinct from the distributional 
extension of function spaces with respect to test functions, and is able to correctly generate the 
pathological behaviour of the limits that are so crucially vital in producing chaos. 



4. Discrete chaotic systems are maximally ill-posed 

The above ideas apply to the development of a criterion for chaos in discrete dynamical systems 
that is based on the limiting behaviour of the graphs of a sequence of functions (/„) on X, rather 
than on the values that the sequence generates as is customary. For the development of the 
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maximality of ill-posedness criterion of chaos, we need to refresh ourselves with the following 
preliminaries. 

Resume Tutorials: Axiom of Choice and Zorn's Lemma 

Let us recall from the first part of this Tutorial that for nonempty subsets {Aa)a^o of a nonempty 
set X, the Axiom of Choice ensures the existence of a set A such that A f] A^ consists of a single 
element for every a. The choice axiom has far reaching consequences and a few equivalent 
statements, one of which the Zorn's lemma that will be used immediately in the following, is 
the topic of this resumed Tutorial. The beauty of the Axiom, and of its equivalents, is that 
they assert the existence of mathematical objects that, in general, cannot be demonstrated and 
it is often believed that Zorn's lemma is one of the most powerful tools that a mathematician 
has available to him that is "almost indispensable in many parts of modern pure mathematics" 
with significant applications in nearly all branches of contemporary mathematics. This "lemma" 
talks about maximal (as distinct from "maximum") elements of a partially ordered set, a set in 
which some notion of xi "preceding" X2 for two elements of the set has been defined. 

A relation ^ on a set X is said to be a partial order (or simply an order) if it is (compare 
with the properties (ER1)-(ER3) of an equivalence relation, Tutoriall) 

(ORl) Reflexive, that is (Vx £ X){x ^x). 

(0R2) Antisymmetric: (Vx, y G X){x :^ y A y ^ x =^ x = y). 

(0R3) Transitive, that is (Vx, y, z G X){x :< y Ay ^ z =^ x ^ z). Any notion of order on a 
set X in the sense of one element of X preceding another should possess at least this property. 

The relation is a preorder ^ if it is only reflexive and transitive, that is if only (ORl) and 
(0R3) are true. If the hypothesis of (0R2) is also satisfied by a preorder, then this ;^ induces 
an equivalence relation ~ on X according to (x y) A (y x) <^ x y that evidently is 
actually a partial order iffx~y44>x = 2/. For any element [x] G X/ ~ of the induced quotient 
space, let < denote the generated order in X/ ~ so that 

x;^y [x] < [y]; 

then < is a partial order on X/ ~. If every two element of X are comparable, in the sense that 
either xi ^ X2 or X2 ^ xi for all xi, X2 G X, then X is said to be a totally ordered set or a chain. 
A totally ordered subset (C, ^) of a partially ordered set {X, ^) with the ordering induced from 
X, is known as a chain in X if 

C = {xGX: (VcGX)(c^xVx^c)}. (37) 

The most important class of chains that we are concerned with in this work is that on the 
subsets Vi^X) of a set {X,C) under the inclusion order; Eq. (|37|) . as we shall see in what 
follows, defines a family of chains of nested subsets in V{X). Thus while the relation ^ in 
Z defined by ui ^ n2 | < | I with ni,?i2 G Z preorders Z, it is not a partial order 

because although —n ^ n and n ^ —n for any n € Z, it is does not follow that —n = n. A 
common example of partial order on a set of sets, for example on the power set V{X) of a set X 
(see footnote ??), is the inclusion relation C: the ordered set X = {V{{x,y, z}),C) is partially 
ordered but not totally ordered because, for example, {x,y} ^ {y,x}, or {x} is not comparable 
to {y} unless x = y; however C = {{0, {x}, {x, y}} does represent one of the many possible 
chains of X. Another useful example of partial order is the following: Let X and {Y, <) be sets 
with < ordering Y, and consider f,g€ Map(X, y) with T){f),T)[g) C X. Then 

(P(/)CP(5))(/ = 5|^(^)) ^ f^g 
{V{f)=V{g)){n{f)<zn{g)) ^ f<g (38) 
(Vx G V{f) = V{g)) {fix) < g{x)) ^ f^g 

define partial orders on Map(X, y). In the last case, the order is not total because any two 
functions whose graphs cross at some point in their common domain cannot be ordered by the 
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given relation, while in the first any / whose graph does not coincide with that of g on the 
common domain is not comparable to it by this relation. 

Let (X, ^) be a partially ordered set and let A be a subset of X. An element 0+ G ^) 
is said to be a maximal element of A with respect to ^ if 

(Va G {A, ^))(a+ ^ a) ^ a = a+, (39) 

that is iff there is no a G A with a ^ and a >- a+^^. Expressed otherwise, this implies that 
an element a+ of a subset A C (X, ^) is maximal in {A, ^) iff it is true that 

(a ^ a+ G A) (for every a G (vl, ^) comparable to a+); (40) 

thus a+ in A is a maximal element of A iff it is strictly greater than every other comparable 
element of A. This of course does not mean that each element o of A satisfies a ^ a+ because 
every pair of elements of a partially ordered set need not be comparable: in a totally ordered 
set there can be at most one maximal element. In comparison, an element a^o of a subset 
A Q {X, :<) is the unique maximum (largest, greatest, last) element of A iff 

(a ^ Ooo G A) (for every a G (A, ^)), (41) 

implying that a^o is the element of A that is strictly larger than every other element of A. As in 
the case of the maximal, although this also does not require all elements of A to be comparable 
to each other, it does require a^o to be larger than every element of A. The dual concepts of 
minimal and minimum can be similarly defined by essentially reversing the roles of a and b in 
relational expressions like a <b. 

The last concept needed to formalize Zorn's lemma is that of an upper bound: For a subset 
{A, ^) of a partially ordered set {X, ^), an element u of X is an upper bound of A in X iff 

(a ^ n G (X, <)) (for every a G {A, <)) (42) 

which requires the upper bound u to be larger than all members of A, with the corresponding 
lower bounds of A being defined in a similar manner. Of course, it is again not necessary that 
the elements of A be comparable to each other, and it should be clear from Eqs. (|41j) and (|42|) 
that when an upper bound of a set is in the set itself, then it is the maximum element of the set. 
If the upper (lower) bounds of a subset {A, ■<) of a set (X, <) has a least (greatest) element, 
then this smallest upper bound (largest lower bound) is called the least upper bound [greatest 
lower bound) or supremum (infimum) of A in X. Combining Eqs. (|41() and (|42() then yields 

sup A = {a^ G flA ■ o-^ :< uy u £ {^Ia, ^)} 

^ (43) 
inf^ = {_a G Aa: / ^ V / G (Aa,^)} ^ 

where ^Ia = { u £ X : (\/ a £ A){a ^ u)} and A^ = {/ G X : (\/ a £ A){1 ^ a)} are the sets of 
all upper and lower bounds of vl in X. Equation ()43() may be expressed in the equivalent but 
more transparent form as 



sup A <^=^ (a £ A ^ a ^ a^) A (cq -< a«- =^ oq ~< b ^ for some b £ A) 



X 



a = inf A <^=^ (a £ A ^ _»a :< a) A {^a -< ai ^ _»a ^ 6 -< oi for some 6 G ^) 



(44) 



X 



to imply that (->fl) is the upper (lower) bound of A in X which precedes (succeeds) every 
other upper (lower) bound of A in X. Notice that uniqueness in the definitions above is a direct 
consequence of the uniqueness of greatest and least elements of a set. It must be noted that 

^^If :< is an order relation in X then the strict relation < in X corresponding to ^. given hy x < y <=> 
y) A {x ^ y), is not an order relation because unlike ^, ^ is not reflexive even though it is both 
transitive and asymmetric. 
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whereas maximal and maximum are properties of the particular subset and have nothing to do 
with anything outside it, upper and lower bounds of a set are defined only with respect to a 
superset that may contain it. 

The following example, beside being useful in Zorn's lemma, is also of great significance in 
fixing some of the basic ideas needed in our future arguments involving classes of sets ordered 
by the inclusion relation. 

Example 4.1. Let X = 'P{{a,b,c}) be ordered by the inclusion relation C. The subset 
A = 'P{{a, b, c}) — {a, b, c} has three maximals {a, b}, {b, c} and {c, a} but no maximum as there 
is no Aoo G A satisfying A ^ for every A € A, while V{{a, b, c}) — the three minimals 
{a}, {b}, and {c} but no minimum. This shows that a subset of a partially ordered set may 
have many maximals (minimals) without possessing a maximum (minimum), but a subset has 
a maximum (minimum) iff this is its unique maximal (minimal). If ^ = {{a,b},{a,c}}, then 
every subset of the intersection of the elements of A, namely {a} and 0, are lower bounds of 
A, and all supersets in X of the union of its elements — which in this case is just {a, b, c} — 
are its upper bounds. Notice that while the maximal (minimal) and maximum (minimum) are 
elements of A, upper and lower bounds need not be contained in their sets. In this class {X, C) 
of subsets of a set X, is a maximal element of X iff X+ is not contained in any other subset 
of X, while Xoo is a maximum of X iff X^q contains every other subset of X. 

Let A := {Aa S X}a£D be a nonempty subclass of {X,C), and suppose that both [J Aa 
and fl^ 

a are elements of X. Since each A^ is C-less than U^a; it follows that (J Aq, is an 
upper bound of A; this is also be the smallest of all such bounds because if U is any other 
upper bound then every A^ must precede U by Eq. and therefore so must |J A^ (because 
the union of a class of subsets of a set is the smallest that contain each member of the class: 
Aa ^ U^a ^ U for subsets {Aa) and U of X). Analogously, since is ^-less than 

each Aa it is a lower bound of A] that it is the greatest of all the lower bounds L m X follows 
because the intersection of a class of subsets is the largest that is contained in each of the 
subsets: L C Aa =^ L C Aa for subsets L and (Aa) of X. Hence the supremum and infimum 
of A in {X,C^) given by 

A^= sup A = \ \ A and ^A = inf ^ = H ^ (45) 

i^'Q La 

are both elements of {X, C). Intuitively, an upper (respectively, lower) bound of ^ in A:" is any 
subset of X that contains (respectively, is contained in) every member of A. ■ 

The statement of Zorn's lemma and its proof can now be completed in three stages as 
follows. For Theorem 4.1 below that constitutes the most significant technical first stage, let g 
be a function on {X, ^) that assigns to every x £ X an immediate successor y £ X such that 

l\f[{x) = {yyx: ^x^,£X satisfying x -< -< y} 

are all the successors of x in X with no element of X lying strictly between x and y. Select a 
representative of M(x) by a choice function /c such that 

g{x) = /c(M(x)) G M(x) 

is an immediate successor of x chosen from the many possible in the set M(x). The basic idea 
in the proof of the first of the three-parts is to express the existence of a maximal element of 
a partially ordered set X in terms of the existence of a fixed point in the set, which follows as 
a contradiction of the assumed hypothesis that every point in X has an immediate successor. 
Our basic application of immediate successors in the following will be to classes X C {V{X), C) 
of subsets of a set X ordered by inclusion. In this case for any A £ X, the function g can be 
taken to be the superset 

g{A)=A[jfc{9{A)), wheve 9{A) = {x £ X - A: A[j{x} £ X} (46) 
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of A. Repeated application of 5 to ^ then generates a principal filter, and hence an associated 
sequence, based at A. 

Theorem 4.1. Let {X, ^) he a partially ordered set that satisEes 

(STl) There is a smallest element xq of X which has no immediate predecessor in X. 
(ST2) If C X is a totally ordered subset in X, then c* = sup^ C is in X . 

Then there exists a maximal element x+ of X which has no immediate successor in X. □ 
Proof. Let T C (X, ^) be a subset of X. If the conclusion of the theorem is false then the 
alternative 

(ST3) Every element x G T has an immediate successor g{x) in T^^ 

leads, as shown below, to a contradiction that can be resolved only by the conclusion of the 
theorem. A subset T of (X, <) satisfying conditions (STl) — (ST3) is sometimes known as an 
g-tower or an g-sequence: an obvious example of a tower is {X, <) itself. If 

^T = € T: T is an xo — tower} 

is the {V{X), C)-infimum of the class T of all sequential towers of (X, <), we show that this 
smallest sequential tower is infact a sequential totally ordered chain in (X, <) built from xq by 
the ^(-function. Let the subset 

Ct = {c E X: (Vt G ^r)(t ^ cVc^ t)} C X (47) 

of X be an (7-chain in in the sense that (cf. Eq. (jSZ|)) it is that subset of X each of whose 
elements is comparable with some element of _>T. The conditions (STl) — (ST3) for Ct can be 
verified as follows to demonstrate that Ct is an 5-tower. 

(1) Xo G Ct, because it is less than each x € ->T. 

(2) Let = sup Y C't be the supremum of the chain Ct in X so that by (ST2), G X. 
Let t S ^r. If there is some c E Ct such that t ^ c, then surely t ^ c^. Else, c ^ t for every 
c G Ct shows that ^ t because is the smallest of all the upper bounds t of Ct- Therefore 

e Ct. 

(3) In order to show that g{c) G C whenever c € C it needs to verified that for all t G _>T, 
either t < c ^ t < g{c) or c ^ t =^ g{c) ^ t. As the former is clearly obvious, we investigate the 
later as follows; note that g{t) G by (ST3). The first step is to show that the subset 

Cg = {te ^T: (Vc G CT)(t <cy g{c) < t)] (48) 

of ^T, which is a chain in X (observe the inverse roles of t and c here as compared to that in 
Eq. (dZI)), is a tower: Let t^ be the supremum of Cg and take c G C. If there is some t £ Cg for 
which g{c) ^ t, then clearly g{c) ^ t^. Else, t ^ x for each t £ Cg shows that t^ ^ c because 
t^ is the smallest of all the upper bounds c of Cg. Hence t^ £ Cg. 

Property (ST3) for Cg follows from a small yet significant modification of the above argu- 
ments in which the immediate successors g{t) oi t E Cg formally replaces the supremum t^ 
of Cg. Thus given a c G C, if there is some t £ Cg for which g{c) ^ t then g{c) -< g{t); this 
combined with (c = t) ^ (fi'(c) = dit)) yields g{c) ^ g{t). On the other hand, t ^ c for every 
t £ Cg requires g{t) ^ c as otherwise (t ^ c) ^ (c ^ S'(^)) would, from the resulting consequence 
t -< c -< g{t), contradict the assumed hypothesis that g{t) is the immediate successor of t. Hence, 
Cg is a 5f-tower in X. 

To complete the proof that g{c) £ Ct, and thereby the argument that Ct is a tower, we 
first note that as ^T is the smallest tower and Cg is built from it, Cg = ^T must infact be —,T 

^^This makes T, and hence X, inductively defined infinite sets. It should be realized that (ST3) does 
not mean that every member of T is obtained from g, but only ensures that the immediate successor of 
any clement of T is also in T. The infimum of these towers satisfies the additional property of being 
totally ordered (and is therefore essentially a sequence or net) in {X, ^) to which (ST2) can be applied. 
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itself. From Eq. H48|) therefore, for every t G _>T either t ^ g{c) or g{c) ^ t, so that g{c) G Ct 
whenever c E Ct- This concludes the proof that Ct is actually the tower in X. 
From (ST2), the implication of the chain Ct 

Ct = -.T = Cg (49) 

being the minimal tower _>r is that the supremum of the totally ordered in its own 
tower (as distinct from in the tower X: recall that is a subset of X) must be contained in 
itself, that is 

sup(Ct) =t^G QX. (50) 
Ct 

This however leads to the contradiction from (ST3) that g{t^) be an element of ^T, unless of 
course 

g{t^)=t^, (51) 

which because of H49|) may also be expressed equivalently as g{c^) = € Ct- As the sequential 
totally ordered set -^T is a subset of X, Eq. (48) implies that t^ is a maximal element of X 
which allows (ST3) to be replaced by the remarkable inverse criterion that 

(ST3') If X E X and w precedes w < x, then w ^ X 

that is obviously false for a general tower T. In fact, it follows directly from Eq. (|39() that under 
(ST3') any & X is a maximal element of X iff it is a fixed point of g as given by Eq. (|5H) . 
This proves the theorem and also demonstrates how, starting from a minimum element of a 
partially ordered set X, (ST3) can be used to generate inductively a totally ordered sequential 
subset of X leading to a maximal = S (X, ^) that is a fixed point of the generating 
function g whenever the supremum t^ of the chain is in X. ■ 

Remarks. The proof of this theorem, despite its apparent length and technically involved 
character, carries the highly significant underlying message that 

Any inductive sequential g- construction of an infinite chained tower Ct starting with a 
smallest element xq G (X, :<) such that a supremum of the g-generated sequential 
chain Ct in its own tower is contained in itself, must necessarily terminate with a fixed 
point relation of the type (|51|) with respect to the supremum. Note from Eqs. Ii5(jl\) 
and i51\) that the role of (ST2) applied to a fully ordered tower is the identification 
of the maximal of the tower — which depends only the tower and has nothing to do 
with anything outside it — with its supremum that depends both on the tower and its 
complement. 

Thus although purely set-theoretic in nature, the filter-base associated with a sequentially totally 
ordered set may be interpreted to lead to the usual notions of adherence and convergence of 
filters and thereby of a generated topology for (X, ^), see Appendix Al and Example Al.3. 
This very significant apparent inter-relation between topologies, filters and orderings will form 
the basis of our approach to the condition of maximal ill-posedness for chaos. 

In the second stage of the three stage programme leading to Zorn's lemma, the tower Theo- 
rem 4.1 and the comments of the preceding paragraph are applied at one higher level to a very 
special class of the power set of a set, the class of all the chains of a partially ordered set, to 
directly lead to the physically significant 

Theorem 4.2. HausdorfF Maximal Principle. Every partially ordered set {X, ^) has a 

maximal totally ordered subset.^^ □ 

^^Recall that this means that if there is a totally ordered chain C in {X, ^) that succeeds C+, then C 
must be C+ so that no chain in X can be strictly larger than C+ . The notation adopted here and below is 
the following: li X = {x, y} is a non-empty set, then X := V{X) = {A: A C X} = {0, {x}, {y}, {x, y}} 
is the set of subsets of X, and X := V^{X) ~ {A : A C X}, the set of all subsets of X, consists 
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Proof. Here the base level is 

X = {C £ V{X) : C is a chain in (X, ^)} C V{X) (52) 

be the set of all the totally ordered subsets of {X,<). Since X \s a collection of (sub)sets of 
X, we order it by the inclusion relation on X and use the tower Theorem to demonstrate that 
C) has a maximal element C^, which by the definition ol X , is the required maximal chain 
in [X,<). 

Let C be a chain in X of the chains in {X,<). In order to apply the tower Theorem to 
{X, C) we need to verify hypothesis (ST2) that the smallest 

C7, = supC =\JC (53) 

of the possible upper bounds of C (see Eq. (|45|) ) is a chain of (X, <). Indeed, if xi, 2:2 G X are 
two points of Csup with xi € Ci and X2 € C2, then from the C-comparability of Ci and C2 we 
may choose xi, X2 G Ci 5 C2, say. Thus xi and X2 are ^-comparable as Ci is a chain in (X, ^); 
C^, G is therefore a chain in (X, ^) which establishes that the supremum of a chain of (A', C) 
is a chain in (X, <). 

The tower Theorem 4.1 can now applied to {X, C) with Co as its smallest element to con- 
struct a (^-sequentially towered fully ordered subset of X consisting of chains in X 

Ct = {Ci G V{X) : Ci C Cj for i < j G N} = _»r C V{X) 

of (^, ^) — consisting of the common elements of all ^-sequential towers T G T of (A^, C) — 
that infact is a principal filter base of chained subsets of (X, <) at Cq. The supremum (chain 
in X) C<_ of Ct in Ct must now satisfy, by Thm. 4.1, the fixed point (7-chain of X 

sup(Ct) = C^ = g{C^) G Ct C p(X), 

Ct 

where the chain 5(C) = CU/c(S(C) - C) with g(C) = {x G X - C : C[_\{x} G ;f}, is an 
immediate successor of C obtained by choosing one point x = /c(S(C') — C) from the many 
possible in S(C') — C such that the resulting g{C) = CU{x} is a strict successor of the chain C 
with no others lying between it and C. Note that C<_ G {X ,C^) is only one of the many maximal 
fully ordered subsets possible in (X, ■<). ■ 

With the assurance of the existence of a maximal chain C^ among all fully ordered subsets 
of a partially ordered set (X, ^), the arguments are completed by returning to the basic level 
of X. 

Theorem 4.3. Zorn's Lemma. Let (X, <) be a partially ordered set such that every totally 
ordered subset of X has an upper bound in X. Then X has at least one maximal element with 
respect to its order. □ 

Proof. The proof of this final part is a mere application of the Hausdorff Maximal Principle 
on the existence of a maximal chain C^ in X to the hypothesis of this theorem that has an 
upper bound u in X that quickly leads to the identification of this bound as a maximal element 
x+ of X. Indeed, if there is an element v G X that is comparable to u and v u, then v cannot 
be in C^ as it is necessary for every x G C^ to satisfy x ^ u. Clearly then C<_ U{^} is a chain 
in (X, :<) bigger than C^ which contradicts the assumed maximality of C<_ among the chains 
of X. ■ 

The sequence of steps leading to Zorn's Lemma, and thence to the maximal of a partially 
ordered set, is summarised in Fig. E3 

of the 16 elements 0, {0}, {{x}}, {{y}}, {{x,y}}, {{0},M}, {{0},{y}}, {{0}, {a;, y}}, {{x},{y}}, 
{{x}, {x, y}}, {{y}, {x, y}}, {{0}, {x}, {y}}, {{0}, {x}, {x, y}}, {{0}, {y}, {x, y}}, {{x}, {y}, {x, y}}, and 
X: an element of 'P^{X) is a subset of 'P{X), any element of which is a subset of X. Thus 
if C = {0,1,2} is a chain in (X = {0, 1,2},<), then C = {{0}, {0, 1}, {0, 1, 2}} C V{X) and 
C = {{{0}}, {{0}, {0, 1}}, {{0}, {0, 1}, {0, 1, 2}}} C V^iX) represent chains in (P(X), C) and (V^iX), C) 
respectively . 
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Figure 10: Application of Zorn's Lemma to {X, :<). Starting with a partially ordered set {X, construct: 

(a) The one-level higher subset X — {C G ViX) : C is a chain in {X, :<)} of 'P{X) consisting of all the totally 
ordered subsets of {X, 

(b) The smallest common (/-sequential totally ordered towered chain Ct = {Ci £ VIX) : d Cj for i < j} C 
'P{X) of all sequential g-towers of X by Thm. 4.1, which infact is a principal filter base of totally ordered subsets 
of (^i ^) a-t the smallest element Co- 

(c) Apply Hausdorff Maximal Principle to [X, C) to get the subset suP(,^(Ct) = = g{C^) G Ct ^ 7-'(A) of 
[X, ^) as the supremum of [X, C) in Ct- The identification of this supremum as a maximal element of [X, C) is 
a consequence of (ST2) and Eqs. I|5()|l . that actually puts the supremum into X itself. 

By returning to the original level (X, <) 

(d) Zorn's Lemma finally yields the required maximal element u G as an upper bound of the maximal totally 
ordered subset (C^, <) of {X, <). 



The dashed segment denotes the higher Hausdorff (A", C) level leading to the base {X, ^) Zorn level. 
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The three examples below of the application of Zorn's Lemma clearly reflect the increasing 
complexity of the problem considered, with the maximals a point, a subset, and a set of subsets 
of X, so that these are elements of X, V{X), and 'P^(X) respectively. 

Example 4.2. (1) Let X = ({a, 6, c}, ^) be a three-point base-level ground set ordered lexico- 
graphically, that is a ^ 6 ^ c. A chain C of the partially ordered Hausdorff-level set X consisting 
of subsets of X given by Eq. (|^^ is, for example, {{a}, {a, h}} and the six (^-sequential chained 
towers 

Ci = {0, {a}, {a, 6}, {a, h, c}}, C2 = {0, {a}, {a, c}, {a, 6, c}} 
C3 = {0, {6}, {a, 6}, {a, 6, c}}, C4 = {0, {6}, {6, c}, {a, 6, c}} 
C5 = {0, {c}, {a, c}, {a, 6, c}}, Cg = {0, {c}, {6, c}, {a, 6, c}} 

built from the smallest element corresponding to the six distinct ways of reaching {a, 6, c} 
from along the sides of the cube marked on the figure with solid lines, all belong to X] see 
Fig. Ilir b). An example of a tower in {X , C) which is not a chain is 

T = {0, {a}, {6}, {c}, {a, 6}, {a, c}, {6, c}, {a, 6, c}}. 

Hence the common infimum towered chained subset 

CT = {0,{a,6,c}} = ^TCP(X) 

of A", with 

sup(Ct) = C^ = {a, 6, c} = g{C^) G Ct C 

Ct 

the only maximal element of V{X). Zorn's Lemma now assures the existence of a maximal 
element of c £ X. Observe how the maximal element of (X, <) is obtained by going one level 
higher to X at the Hausdorff stage and returning to the base level X at Zorn, see Fig. EH for a 
schematic summary of this sequence of steps. 

(2) Basis of a vector space. A linearly independent set of vectors in a vector space X that 
spans the space is known as the Hamel basis of X. To prove the existence of a Hamel basis in 
a vector space, Zorn's lemma is invoked as follows. 

The ground base level of the linearly independent subsets of X 

X = G r{X) : Span({x,j/=i) = (aj)^^, = VJ > 1} C r{X)), 

with Span{{xi.}j^^) := J2j=i'^j^ij^ such that no x G ^ can be expressed as a linear com- 
bination of the elements oi X — {x}. X clearly has a smallest element, say {xj^}, for some 
non-zero £ X. Let the higher Hausdorff level 

X = {C £ V^{X) : C is a chain in {X, C)} C V^{X) 

collection of the chains 

of X comprising linearly independent subsets of X be (7-built from the smallest {xj^ }. Any chain 
£ of j£ is bounded above by the union = sup^^ <t = IJceC ^ which is a chain in X containing 
{xj^}, thereby verifying (ST2) for X. Application of the tower theorem to X implies that the 
chain 

e:T = {c,,,Ci,,--- ,Ci^,---}= -.iQV^x) 

in X of chains of is a (7-sequential fully ordered towered subset of {X, C) consisting of the 
common elements of all (7-sequential towers of {X, C), that infact is a chained principal ultrafilter 
on (V{X),C) generated by the filter-base {{{xij}}} at {xj^}, where 
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(a) 





(b) 



Fi gure 11* Tree diagrams of two partially ordered sets where two points are connected by a line iff they are 
comparable to each other, with the solid lines linking immediate neighbours and the dashed, dotted and dashed- 
dotted lines denoting second, third and fourth generation orderings according to the principle of transitivity of 
the order relation. There are 8x2 chains of (a) and 7 chains of (b) starting from respective smallest elements 
with the immediate successor chains shown in solid lines. The 17 point set X — {0, 1, 2, • ■ ■ , 15, 16} in (a) has 
two maximals but no maximum, while in (b) there is a single maximum of V{{a,b,c}), and three maximals 
without any maximum for P({a, b, c}) — {a, b, c}. In (a), let A — {1, 3, 4, 7, 9, 10, 15}, B = {1, 3, 4, 6, 7, 13, 15}, 
C — {1, 3, 4, 10, 11, 16} and D = {1, 3, 4}. The upper bounds of D in A are 7, 10 and 15 without any supremum 
(as there is no smallest element of {7, 10, 15}), and the upper bounds of _D in _B are 7 and 15 with supg(_D) = 7, 
while sup^(_D) — 10. Finally the maximal, maximum, and the supremum in A of {1,3,4,7} are all the same 
illustrating how the supremum of a set can belong to itself. Observe how the supremum and upper bound of a 
set are with reference to its complement in contrast with the maximum and maximal that have nothing to do 
with anything outside the set. 
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for some n G N is an example of non-chained ^-tower whenever (Cj^.)^^ is neither contained 
in nor contains any member of the (Cj^)^^ chain. Hausdorff's chain theorem now yields the 
fixed-point (7-chain £ X oi X 

sup{<Lt) =C^ = {{xi^},{xi^,Xi^},{xi-,,Xi2,XiJ,-- • } = 9{C^) G (^t ^ V'^{X) 

as a maximal totally ordered principal filter on X that is generated by the filter-base {{a^ii}} at 
Xi-^, whose supremum B = {xi^^Xi^, • • • } G V{X) is, by Zorn's lemma, a maximal element of 
the base level X . This maximal linearly independent subset of X is the required Hamel basis 
for X: Indeed, if the span of B is not the whole of X, then Span(i?) |J x, with x ^ Span(i?) 
would, by definition, be a linearly independent set of X strictly larger than B, contradicting 
the assumed maximality of the later. It needs to be understood that since the infinite basis 
cannot be classified as being linearly independent, we have here an important example of the 
supremum of the maximal chained set not belonging to the set even though this criterion was 
explicitly used in the construction process according to (ST2) and (ST3). 

Compared to this purely algebraic concept of basis in a vector space, is the Schauder basis in a 
normed space which combines topological structure with the linear in the form of convergence: If 
a normed vector space contains a sequence (ej)jgz+ with the property that for every x £ X there 
is an unique sequence of scalars (aj)jgz+ such that the remainder || x — (aiei-|-a2e2 + - • -+0/6/) || 
approaches as / — > 00, then the collection (ej) is known as a Schauder basis for X. 

(3) Ultrafilter. Let X be a set. The set 

fS = {5„ G r{x): s^nsp / 0, Va / /?} c r{x) 

of all nonempty subsets of X with finite intersection property is known as a filter subbase on 
X and pB = {B <^ X : B = Oi^f^^ Si}, for / C D a finite subset of a directed set B, is a 
filter-base on X associated with the subbase pS; cf. Appendix Al. Then the filter generated by 
fS consisting of every superset of the finite intersections B £ pB of sets of pS is the smallest 
filter that contain the subbase pS and base pB. For notational simplicity, we will denote the 
subbase pS in the rest of this example simply by S. 

Consider the base-level ground set of all filter subbases on X 

6 = {S eV^{X): Pi 7^ / for every finite subset of S} C V^{X), 

ordered by inclusion in the sense that Sa ^ 5/3 for all a ^ /3 G O, and let the higher Hausdorff- 
level 

X = {(LeV^{X):(iisa chain in (6, C)} C V^iX) 
comprising the collection of the totally ordered chains 

= {{Sa}, {Sa, Sp}, ■■■ , {Sa, 5/3, • • • , 5J} G V^X) 

of S be 5f-built from the smallest {Sa} then an ultrafilter on X is a maximal member 5+ of 
(©, C) in the usual sense that any subbase 5 on X must necessarily be contained in 5+ so that 
5+ C cS =^ 5 = 5+ for any S C V{X) with FIP. The tower theorem now implies that the 
element 

^^ = {^a,<tp,--- ,C---}= Z^(^V\X) 

of P^(X), which is a chain in j£ of the chains of (5, is a (7-sequential fully ordered towered subset 
of the common elements of all sequential towers of (X, C) and a chained principal ultrafilter on 
('P^(X),C) generated by the filter-base {{{Sa}}} at {Sa}', here 
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is an obvious example of non-chained (7-tower whenever (£0-) is neither contained in, nor contains, 
any member of the G^Q-chain. Hausdorff 's chain theorem now yields the fixed-point <t^ G X 

sM^t) = = {{Sa}, 5^}, {Sa, S^, S^}, • • • } = e^Q V^X) 

as a maximal totally ordered ^-chained towered subset of X that is, by Zorn's lemma, a maximal 
element of the base level subset 6 ofV^{X). is a chained principal ultrafilter on ('P{X), C) 
generated by the filter-base {{Sa}} at Sa, while 5+ = {5q,, 5*^, S-y, • • • } S V'^{X) is an (non- 
principal) ultrafilter on X — characterized by the property that any collection of subsets on X 
with FIP (that is any filter subbase on X) must be contained the maximal set having FIP 
— that is not a principal filter unless Sa is a singleton set {xq}. ■ 

What emerges from these application of Zorn's Lemma is the remarkable fact that infinities 
(the dot-dot-dots) can be formally introduced as "limiting cases" of finite systems in a purely 
set-theoretic context without the need for topologies, metrics or convergences. The significance 
of this observation will become clear from our discussions on filters and topology leading to 
Sec. 4.2 below. Also, the observation on the successive iterates of the power sets 'P{X) in the 
examples above was to suggest their anticipated role in the complex evolution of a dynamical 
system that is expected to play a significant part in our future interpretation and understanding 
of this adaptive and self-organizing phenomenon of nature. 

End Tutorials 

From the examples in Tutorials, it should be clear that the sequential steps summarized in Fig. 
IIUI are involved in an application of Zorn's lemma to show that a partially ordered set has a 
maximal element with respect to its order. Thus for a partially ordered set (X, ^), form the 
set X of all chains C in X. If C+ is a maximal chain of X obtained by the Hausdorff Maximal 
Principle from the chain C of all chains of X, then its supremum u is a maximal element of 
(X, <). This sequence is now applied, paralleling Example 4.2(1), to the set of arbitrary relations 
Multi(X) on an infinite set X in order to formulate our definition of chaos that follows. 

Let / be a noninjective map in Multi(X) and P{f) the number of injective branches of /. 
Denote by 

F = {f £ Multi(X) : / is a noninjective function on X} C Multi(X) 

the resulting basic collection of noninjective functions in Multi(X). 

(i) For every a in some directed set B, let F have the extension property 

(V/«GF)(3/^GF):P(/,)<P(/^) 

(ii) Let a partial order ^ on Multi(X) be defined, for f^Jp G Map(X) C Multi(X) by 

P{fa) < P{fp) ^fa^ (54) 

with P{f) := 1 for the smallest /, define a partially ordered subset {F, ^) of Multi(X). This is 
actually a preorder on Multi(X) in which functions with the same number of injective branches 
are equivalent to each other. 

(iii) Let 

a = {/„ e Multi(X) : fa ^ /.} G V{F), ueB, 
be the ^-chains of non- injective functions of Multi(X) and 

X = {C e V{F) : C is a chain in (F, <)] C V{F) 
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denote the corresponding Hausdorff level of the chains of -F, with 



Ct = {C^,Cp,--- ,a,---}= ^TCViF) 

being a ^-sequential chain in X . By Hausdorff Maximal Principle, there is a maximal fixed-point 
5-towered chain £ X oi F 

sup(Ct) = C^ = {fa, fp, A, • • • } = 9{C^) G Ct C V{F). 

Ct 

Zorn's Lemma now applied to this maximal chain yields its supremum as the maximal element 
of C^, and thereby of F. It needs to be appreciated, as in the case of the algebraic Hamel 
basis, that the existence of this maximal non-functional element was obtained purely set theo- 
retically as the "limit" of a net of functions with increasing non-linearity, without resorting to 
any topological arguments. Because it is not a function, this supremum does not belong to the 
functional (7-towered chain having it as a fixed point, and this maximal chain does not possess 
a largest, or even a maximal, element, although it does have a supremum. The supremum is 
a contribution of the inverse functional relations {f^) in the following sense. From Eq. @, the 
net of increasingly non-injective functions of Eq. (|54)) implies a corresponding net of increasingly 
multivalued functions ordered inversely by the inverse relation fa ^ fp ^ fa ■ Thus the 

inverse relations which are as much an integral part of graphical convergence as are the direct 
relations, have a smallest element belonging to the multifunctional class. Clearly, this smallest 
element as the required supremum of the increasingly non-injective tower of functions defined 
by Eq. (|54|) . serves to complete the significance of the tower by capping it with a "boundary" 
element that can be taken to bridge the classes of functional and non- functional relations on X. 

We are now ready to define a maximally ill-posed problem f[x) = y for x,y £ X in terms of 
a maximally non-injective map f as follows. 

Definition 4.1. Chaotic map. Let A be a non-empty closed set of a compact Hausdorff 
space X. A fmiction f G Multi(X) {equivalently the sequence of functions (fi)) is maximally 
non-injective or chaotic on A with respect to the order relation (|54j) if 

(a) for any fi on A there exists an fj on A satisfying fi ^ fj for every j > i G N. 

(b) the setD^ consists of a countable collection of isolated singletons. □ 

Definition 4.2. Maximally ill-posed problem. Let A be a non-empty closed set of a 
compact Hausdorff space X and let f be a functional relation in Multi(X). The problem 
f{x) = y is maximally ill-posed at y if f is chaotic on A. □ 

As an example of the application of these definitions, on the dense set the tent map 
satisfi es both the con ditions of sensitive dependence on initial conditions and topological transi- 



tivity iDevaneiJ (|1989|) and is also maximally non-injective; the tent map is therefore chaotic on 
P+. In contrast, the examples of Sees. 1 and 2 are not chaotic as the maps are not topologically 
transitive, although the Liapunov exponents, as in the case of the tent map, are positive. Here 
the (fn) are identified with the iterates of /, and the "fixed point" as one through which graphs 
of all the functions on residual index subsets pass. When the set of points P+ is dense in [0, 1] 
and both and [0, 1] — 2?+ = [0, 1] — IJi^o /~*(P6r(/)) (where Per(/) denotes the set of peri- 
odic points of /) are totally disconnected, it is expected that at any point on this complement 
the behaviour of the limit will be similar to that on these points are special as they tie 
up the iterates on Per(/) to yield the multifunctions. Therefore in any neighbourhood C/ of a 
I?+-point, there is an xq at which the forward orbit {/*(a;o)}j>o is chaotic in the sense that 

(a) the sequence neither diverges nor does it converge in the image space of / to a periodic 
orbit of any period, and 

(b) the Liapunov exponent given by 

similar situation arises in the following more intuitive example. Although the subset A = 
{l/n}nez+ of the interval / = [—1,1] has no a smallest or minimal elements, it does have the infi- 
mum 0. Likewise, although A is bounded below by any clement of [—1,0), it has no greatest lower bound 
in [-1,0)U(0,1]. 
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which is a measure of the average slope of an orbit at xq or equivalently of the average loss of 
information of the position of a point after one iteration, is positive. Thus an orbit with positive 
Liapunov exponent is chaotic if it is not asymptotic (that is neither convergent nor adherent, 
having no convergent suborbit in the sense of Appendix Al) to an unstable periodic orbit or to 
any other limit set on which the dynamics is simple. A basic example of a chaotic orbit is that 
of an irrational in [0, 1] under the shift map and that of the chaotic set its closure, the full unit 
interval. 




Figure 12: Multifunctional and cobweb plots of 3.569a:(l — x). Comparison of the graphs for the three values of 
A shown in figures (a)-(f) illustrates how the dramatic changes in the character of the former are conspicuously 
absent in the conventional plots that display no perceptible distinction between the three cases. 

Let / G Map((X, Z//)) and suppose that A = {/-'(xo)}jgn is a sequential set corresponding to 
the orbit Orb(xo) = (/•' (2;o))jGN) and let /Ri(xo) = Uj>i /"'(^o) be the i-residual of the sequence 
(P(xo))jgN, with fBxo = {Miixo) : Res(N) X for all i G N} being the decreasingly nested 
filter-base associated with Orb(xo). The so-called uj-limit set of xq given by 

uj{xo) = {xGX:(3nfcEN)(nfc^oo) (/"fc(xo)^x)} /^^n 
= {xGX:{yNG A4)(V/m, G fB,,) (/m,(xo) (1^/0)} 

is simply the adherence set adh(/-^(xo)) of the sequence (/■'(xo))jeN) see Eq. hence Def. 

Al.ll of the filter-base associated with a sequence and Eqs. (|5S|) . H103|) and H1U6|) allow 

us to express ti;(xo) more meaningfully as 

o.(xo) = nCl(/M.(xo)). (56) 

It is clear from the second of Eqs. (|55() that for a continuous / and any x £ X , x £ uj{xq) implies 
f{x) G a;(xo) so that the entire orbit of x lies in a;(xo) whenever x does implying that the w-limit 
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set is positively invariant; it is also closed because the adherent set is a closed set according 
to Theorem A1.3. Hence xq G u>{xq) =^ A CI uj{xq) reduces the w-limit set to the closure of A 
without any isolated points, A C Der(y4). In terms of Eq. (jlOSf) involving principal filters, Eq. 

in this case may be expressed in the more transparent form u>{xo) = P| Cl( f'P({/'' {xo)}JLq)) 
where the principal filter f'P({/'' (3^o)}^o) ^ consists of all supersets of ^ = {f-^{xo)}j%Q, 
and uj{xq) represents the adherence set of the principal filter at A, see the discussion following 
Theorem Al.3. If A r epresents a cha o tic or bit under this condition, then uj{xq) is sometimes 
known as a chaotic .se nAlligood et al. thus the chaotic orbit infinitely often visits every 

member of its chaotic set^^ which is simply the w-limit set of a chaotic orbit that is itself 
contained in its own limit set. Clearly the chaotic set if positive invariant, and from Thm. Al.3 
and its corollary it is also compact. Furthermore, if all (sub)sequences emanating from points 
xn in some neighbourhood of the set converge to it, then u!{xo) is called a chaotic attractor, see 
Alligood. Sauer and York3 ( 1997 ) . As common examples of chaotic sets that are not attractors 



mention may be made of the tent map with a peak value larger than 1 at 0.5, and the logistic 
map with A > 4 again with a peak value at 0.5 exceeding 1. 




It is important that the difference in the dynamical behaviour of the system on P+ and 
its complement be appreciated. At any fixed point x of /* in (or at its equivalent images 
in [x]) the dynamics eventually gets attached to the (equivalent) fixed point, and the sequence 
of iterates converges graphically in Multi(X) to x (or its equivalent points). When x ^ 2?+, 
however, the orbit A = {/*(x)}igN is chaotic in the sense that {P{x)) is not asymptotically 
periodic and not being attached to any particular point they wander about in the closed chaotic 
set uj{x) = Der(A) containing A such that for any given point in the set, some subsequence of 
the chaotic orbit gets arbitrarily close to it. Such sequences do not converge anywhere but only 
frequent every point of Der(A). Thus although in the limit of progressively larger iterations 
there is complete uncertainty of the outcome of an experiment conducted at either of these two 
categories of initial points, whereas on this is due to a random choice from a multifunctional 
set of equally probable outputs as dictated by the specific conditions under which the experiment 
was conducted at that instant, on its complement the uncertainty is due to the chaotic behaviour 
of the functional iterates themselves. Nevertheless it must be clearly understood that this 

^^How does this happen for A = {/'(a:o)}iGN that is not the constant sequence (xq) at a fixed point? 
As i e N increases, points are added to {xq, /{xq), • • • , f^{xo)} not, as would be the case in a normal 
sequence, as a piled-up Cauchy tail, but as points generally lying between those already present; recall 
a typical graph as of Fig. El for example. 



49 




later behaviour is entirely due to the multifunctional limits at the points which completely 
determine the behaviour of the system on its complement. As an explicit illustration of this 
situation, recall that for the shift map 2x mod(l) the P+ points are the rationals on [0, 1], and 
any irrational is represented by a non-terminating and non-repeating decimal so that almost all 
decimals in [0, 1] in any base contain all possible sequences of any number of digits. For the 
logistic map, the situation is more complex, however. Here the onset of chaos marking the end 
of the period doubling sequence at A* = 3.5699456 is signaled by the disappearance of all stable 
fixed points, Fig. I12r c). with Fig. I12r a) being a demonstration of the stable limits for A = 3.569 
that show up as convergence of the iterates to constant valued functions (rather than as constant 
valued inverse functions) at stable fixed points, shown more emphatically in Fi jl3r a). What 
actually happens at A* is shown in Fig. I16f a) in the next subsection: the almost vertical lines 
produced at a large, but finite, iterations i (the multifunctions are generated only in the limiting 
sense of z — > oo and represent a boundary between functional and non-functional relations on 
a set), decrease in magnitude with increasing iterations until they reduce to points. This gives 
rise to a (totally disconnected) Cantor set on the y-axis in contrast with the connected intervals 
that the multifunctional limits at A > A* of Figs. ll6f bVfd) produce. By our characterization 
Definition 4.1 of chaos therefore, Ax(l — x) is chaotic for the values of A > A* that are shown in 
Fig. 1161 We return to this case in the following subsection. 

As an example of chaos in a noniterative system, we investigate the following question: 
While maximality of non-injectiveness produced by an increasing number of injective branches 
is necessary for a family of functions to be chaotic, is this also sufficient for the system to be 
chaotic? This is an important question especially in the context of a non-iterative family of 
functions where fixed points are of no longer relevant. 

Consider the sequence of functions | sin(7rnx)|J^]^. The graphs of the subsequence | sin(2"~^7r2;)| 
and of the sequence (t"'(x)) on [0,1] are qualitatively similar in that they both contain 2""'^ of 
their functional graphs each on a base of 1/2"'"^. Thus both | sin(2"'~^7rx)|J^;^ and (t"(x))J^]^ 
converge graphically to the multifunction [0,1] on the same set of points equivalent to 0. This is 
sufficient for us to conclude that | sin(2"~^7rx)|,^;^, and hence | sin(7rnx)|^;^, is chaotic on the 
infinite equivalent set [0]. While Fig. El was a comparison of the first four iterates of the tent 
and absolute sine maps. Fig. El following shows the "converged" graphical limits for after 17 
iterations. 

4-1- The chaotic attractor 
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Figure 13: Fixed points and cycles of logistic map. The isolated fixed point of figure (b) yields two non- 
fixed points to which the iterates converge simultaneously in the sense that the generated sequence converges to 
one iff it converges to the other. This suggests that nonlinear dynamics of a system can lead to a situation in 
which sequences in a Hausdorff space may converge to more than one point. Since convergence depends on the 
topology (Corollary to Theorem A1.5), this may be interpreted to mean that nonlinearity tends to modify the 
basic structure of a space. 





Stable 8-cycle, A 
Graphical limit at 



3.55 
9008 




1 



(d) 



0.7 



Figure 1131 contd: Isolated fixed points of logistic map. The sequence of points generated by the iterates of the 
map are marked on the y-axis of (a)-(c) in italics. The singletons {x} are u-limit sets of the respective fixed 
point X and is generated by the constant sequence {x, x,- ■ ■). Whereas in (a) this is the limit of every point in 
(0, 1), in the other cases these fixed points are isolated in the sense of Def. 2.3. The isolated points, however, give 
rise to sequences that converge to more than one point in the form of limit cycles as shown in figures (b)-(d) . 
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17th iterate of tent map .0008 Graph of |sin(2^'57r2;)| .0008 

(a) (b) 

Figure 14: Similarity in the behaviour of the graphs of the (a) tent and (b)| sin(2^''7ra;) | maps at 17 iterations 
demonstrate chaoticity of the later. 

One of the most fascinating characteristics of chaos in dynamical systems is the appearance of 
attractors the dynamics on which are chaotic. For a subset A of a topological space {X,U) 
such that TZ{f{A)) is contained in A — in this section, unless otherwise stated to the contrary, 
f{A) will denote the graph and not the range (image) of f — which ensures that the iteration 
process can be carried out in A, let 

MM) = U /'(^) 

j>ieN 

= U fu/^(-) 

generate the filter-base with Ai := /iRi(A) G fB being decreasingly nested, Ai^i C Ai for all 
i £ N, in accordance with Def. Al.l. The existence of a maximal chain with a corresponding 
maximal element as asssured by the Hausdorff Maximal Principle and Zorn's Lemma respec- 
tively implies a nonempty core of pB. As in Sec. 3 following Def. 3.3, we now identify the 
filterbase with the neighbourhood base at f°° which allows us to define 

Atr(^i) =^ adh(F-B) 

as the attractor of the set Ai, where the last equality follows from Eas. ((5n|) and (^0)) and the 
closure is with respect to the topology induced by the neighbourhood filter base -pB. Clearly the 
attractor as defined here is the graphical limit of the sequence of functions (/*)ieN which may be 
verified by reference to Def. Al.8, Thm. A1.3 and the proofs of Thms. A1.4 and A1.5, together 
with the directed set Eq. with direction The basin of attraction of the attractor is 

Ai because the graphical limit (D+, |J(G(7^+), 7?.+) of Def. 3.1 may be obtained, as 

indicated above, by a proper choice of sequences associated with A. Note that in the context 
of iterations of functions, the graphical limit (I?+,yo) of the sequence (/"(x)) denotes a stable 
fixed point a;* with image x* = f{x^) = yo to which iterations starting at any point x £ P+ 
converge. The graphical limits {xio,TZ+) are generated with respect to the class {xj*} of points 
satisfying /(xjo) = Xi^^, i = 0, 1,2,--- equivalent to unstable fixed point to which 



(5Yj 
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inverse iterations starting at any initial point in 7^+ must converge. Even though only is 
inverse stable, an equivalent class of graphically converged limit multis is produced at every 
member of the class Xj* G [x,,], resulting in the far-reaching consequence that every member of 
the class is as significant as the parent fixed point x* from which they were born in determining 
the dynamics of the evolving system. The point to remember about infinite intersections of a 
collection of sets having finite intersection property, as in Eq. (|58)) , is that this may very well be 
empty; recall, however, that in a compact space this is guaranteed not to be so. In the general 
case, if core(^) / then A is the principal filter at this core, and Atr(Ai) by Eqs. (|58|) and (|in5|) 
is the closure of this core, which in this case of the topology being induced by the filterbase, 
is just the core itself. Ai by its very definition, is a positively invariant set as any sequence of 
graphs converging to Atr(yli) must be eventually in Ai: the entire sequence therefore lies in 
Ai. Clearly, from Thm. A3.1 and its corollary, the attractor is a positively invariant compact 
set. A typical attractor is illustrated by the derived sets in the second column of Fig. 1221 which 
also illustrates that the set of functional relations are open in Multi(X); specifically functional- 
nonfunctional correspondences are neutral-selfish related as in Fig. I22L 3-2, with the attracting 
graphical limit of Eq. ()58() forming the boundary of (finitely)many-to-one functions and the 
one-to- (finitely) many multifunctions. 

Equation ()58|) is to be compared with the image definition of an attractor Stuart and Humphries! 



where f{A) denotes the range and not the graph of /. Then Eq. (|58j) can be used to 
define a sequence of points S An^. and hence the subset 

u;{A) = {xGX:(3nfcGN)(nfc^cx))(3xfcG^„J (rn^fc)^^)} 

= {x£X:{yNeM,){yA£A)iNf]Ai^^)} (59) 

as the corresponding attractor of A that satisfies an equation formally similar to ()58() with 
the difference that the filter-base A is now in terms of the image f{A) of A, which allows the 
adherence expression to take the particularly simple form 

cviA) = n Clif\A)). (60) 

The complimentary subset excluded from this definition of uj{A), as compared to Atr(yli), that 
is required to complete the formalism is given by Eq. (|61() below. Observe that the equation 
for uj{A) is essentially Eq. (|87|) . even though we prefer to use the alternate form of Eq. I|88|) as 
this brings out more clearly the frequenting nature of the sequence. The basin of attraction 

Bf{A) = {x e A: Lo{x) C Atv{A)} 

= {x e A: {3nk en) {uk ^ oo) if ""f^ix) ^ X* e oj{ A) ) ^ ' 

of the attractor is the smallest subset of X in which sequences generated by / must eventually 
lie in order to adhere at uj{A). Comparison of Eqs. (|62() with (|33() and ()61() with ()32p show that 
CO (A) can be identified with the subset 7^+ on the y-axis on which the multifunctional limits 
G : X oi graphical convergence are generated, with its basin of attraction being contained 

in the T>-^ associated with the injective branch of / that generates In summary it may 

be concluded that since definitions and (|HT|) involve both the domain and range of /, a 
description of the attractor in terms of the graph of /, like that of Eq. (|58|) . is more pertinent 
and meaningful as it combines the requirements of both these equations. Thus, for example, as 
u;{A) is not the function G{TZ+), this attractor does not include the equivalence class of inverse 
stable points that may be associated with for example Fig. 1151 

Prom Eq. H59() . we may make the particularly simple choice of (x^) to satisfy /"'''(x_fc) = x 
so that x_fc = /g"''(x), where x_fc G [x-k] ■= f~^^{x) is the element of the equivalence class 
of the inverse image of x corresponding to the injective branch /b. This choice is of special 
interest to us as it is the class that generates the G- function on in graphical convergence. 
This allows us to express uj{A) as 

uj{A) = {xe X: {3nk G N)(nfc oo)(/b "'''(x) = x_fc converges in {X,U))}; (62) 
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note that the x_fc of this equation and the of Eq. H59|) are, in general, quite different points. 

A simple illustrative example of the construction of io{A) for the positive injective branch 
of the homeomorphism (4x^ — l)/3, —1 < x < 1, is shown in Fig. 1151 where the arrow-heads 
denote the converging sequences /""'(xj) x and /"'"'"(xj) — > X—fYi which proves invariance of 
CO (A) for a homeomorphic /; here continuity of the function and its inverse is explicitly required 
for invariance. Positive invariance of a subset A oi X implies that for any n E N and x £ A, 
/"(x) = i/n & A, while negative invariance assures that for any y £ A, f~"'{y) = x_„ E A. 
Invariance of A in both the forward and backward directions therefore means that for any 
y G A and n E N, there exists a x £ A such that /"(x) = y. In interpreting this figure, it 
may be useful to recall from Def. 4.1 that an increasing number of injective branches of / is 
a necessary, but not sufficient, condition for the occurrence of chaos; thus in Figs. llHf a) and 
1151 increasing noninjectivity of / leads to constant valued limit functions over a connected P+ 
in a manner similar to that associated with the classical Gibb's phenomenon in the theory of 
Fourier series. 



1 




X_i X_3 

Figure 15: The attractor for f(x) = {4x^ — l)/3, for —1 < x < 1. The converging sequences are denoted 
by arrows on the right, and (xk) are chosen according to the construction shown. This example demonstrates 
how ahhough A C f{A), where A = [0, 1] is the domain of the positive injective branch of /, the succeeding 
images (/'(^))i>i satisfy the required restriction for iteration, and A in the discussion above can be taken to be 
f{A); this is permitted as only a finite number of iterates is thereby discarded. It is straightforward to verify 
that Atr(Ai) = (-1, [-0.25, 1]) U((-l, 1). -0.25) U(l, [-0.25, 1]) with F{x) = -0.25 on V- = (-1, 1) = V+ and 
G{y) = 1, and — 1 on TZ- — [—0.25, 1] = TZ+. By comparison, u}{A) from either its definition Eq. H59^ or from 
the equivalent intersection expression Eq. 16011 . is simply the closed interval TZ+ — [—0.25, 1]. The italicized 
iterate numbers on the graphs show how the oscillations die out with increasing iterations from a: = ±1 and 
increasingly approach —0.25 in all neighbourhoods of 0. 

Graphical convergence of an increasingly nonlinear family of functions implied by its in- 
creasing non-injectivity may now be combined with the requirements of an attractor to lead to 
the concept of a chaotic attractor to be that on which the dynamics is chaotic in the sense of 
Defs. 4.1. and 4.2. Hence 

Definition 4.3. Chaotic Attractor. Let A be a positively invariant subset of X . The attractor 
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Atr(^) is chaotic on A if there is sensitive dependence on initial conditions for all x £ A. The 
sensitive dependence manifests itself as multifunctional graphical limits for all x G and as 
chaotic orbits when x D^. □ 




Figure 16: Chaotic attractors for A = A, and A = 3.575. For the logistic map the usual bifurcation diagram (e) 
shows the chaotic attractors for A > A, = 3.5699456, while (a) — (d) display the graphical limits for four values 
of A chosen for the Cantor set and 4,- 2-, and 1-piece attractors respectively. In (f) the attractor [0, 1] (where 
the dotted lines represent odd iterates and the solid lines even iterates of /) disappear if / is reflected about the 
a;-axis. The function ft{x) is given by 

f(^^-j 2(l + a;)/3, 0<x<l/2 
/fW <y 2(1-2;), l/2<s<l 

The picture of chaotic attractors that emerge from the foregoing discussions and our char- 
acterization of chaos of Def. 4.1 is that it it is a subset of X that is simultaneously "spiked" 
multifunctional on the y-axis and consists of a dense collection of singleton domains of attrac- 
tion on the X-axis. This is illustrated in Figure IT^ which shows some typical chaotic attractors. 
The first four diagrams (a) — (d) are for the logistic map with (b) — (d) showing the 4-, 2- and 
1-piece attractors for A = 3.575, 3.66, and 3.8 respectively that are in qualitative agreement 
with the standard bifurcation diagram reproduced in (e). Figs, (b) — (d) have the advantage of 
clearly demonstrating how the attractors are formed by considering the graphically converged 
limit as the object of study unlike in Fig. (e) which shows the values of the 501-lOOlth iterates 
of xq = 1/2 as a function of A. The difference in Figs, (a) and (b) for a change of A from 
A > A* = 3.5699456 to 3.575 is significant as A = A* marks the boundary between the nonchaotic 
region for A < A* and the chaotic for A > A* (this is to be understood as being suitably modified 
by the appearance of the nonchaotic windows for some specific intervals in A > A*). At A* the 
generated fractal Cantor set A is an attractor as it attracts almost every initial point xq so that 
the successive images x" = /"(xq) converge toward the Cantor set A. In Fig. (f) the chaotic 
attractors for the piecewise continuous function on [0, 1] 



2(l + x)/3, 0<x<l/2 
2(1 -x), l/2<x<l, 



is [0, 1] where the dotted lines represent odd iterates and the full lines even iterates of /; here 
the attractor disappears if the function is reflected about the x-axis. 
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Figure [TBI contd. Chaotic attractors for A — 3.66 and A = 3.8. 

4-2. Why Chaos? A Preliminary Inquiry 

The question as to why a natural system should evolve chaotically is both interesting and rel- 
evant, and this section attempts to advance a plausible answer to this inquiry that is based on 
the connection between topology and convergence contained in the Corollary to Theorem Al.5. 
Open sets are groupings of elements that govern convergence of nets and filters, because the 
required property of being either eventually of frequently in (open) neighbourhoods of a point 
determines the eventual behaviour of the net; recall in this connection the unusual convergence 
characteristics in cofinite and cocountable spaces. Conversely for a given convergence character- 
istic of a class of nets, it is possible to infer the topology of the space that is responsible for this 
convergence, and it is this point of view that we adopt here to investigate the question of this 
subsection: recall that our Definitions 4.1 and 4.2 were based on purely algebraic set-theoretic 
arguments on ordered sets, just as the role of the choice of an appropriate problem-dependent 
basis was highlighted at the end of Sec. 2. 
Chaos as manifest in its attractors is a direct consequence of the increasing nonlinearity of 
the map with increasing iteration; we reemphasize that this is only a necessary condition so 
that the increasing nonlinearities of Figs. \1'A\ and IT^ eventuallv lead to stable states and not to 
chaotic instability. Under the right conditions as enunciated following Fig. IIUI chaos appears 
to be the natural outcome of the difference in the behaviour of a function / and its inverse 
/~ under their successive applications. Thus / = ff~f allows / to take advantage of its 
multi inverse to generate all possible equivalence classes that is available to it, a feature not 
accessible to /~ = f~ff~- As we have seen in the foregoing, equivalence classes of fixed points, 
stable and unstable, are of defining significance in determining the ultimate behaviour of an 
evolving dynamical system and as the eventual (as also frequent) character of a filter or net in 
a set is dictated by open neighbourhoods of points of the set, it is postulated that chaoticity on 
a set X leads to a reformulation of the open sets of X to equivalence classes generated by the 
evolving map f , see Example 2.4(3). Such a redefinition of open sets of equivalence classes allow 
the evolving system to temporally access an ever increasing number of states even though the 
equivalent fixed points are not fixed under iterations of / except for the parent of the class, and 
can be considered to be the governing criterion for the cooperative or collective behaviour of the 
system. The predominance of the role of /~ in / = ff^f in generating the equivalence classes 
(that is exploiting the many-to-one character) of / is reflected as limit multis for / (that is 
constant /~ on in /~ = f~ff~; this interpretation of the dynamics of chaos is meaningful 
as graphical convergence leading to chaos is a result of pointwise biconvergence of the sequence 
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of iterates of the functions generated by /. But as / is a noninjective function on X possessing 
the property of increasing nonlinearity in the form of increasing noninjectivity with iteration, 
various cycles of disjoint equivalence classes are generated under iteration, see for example Fig. 
ini^a) for the tent map. A reference to Fig. El shows that the basic set Xb, for a finite number n 
of iterations of /, contains the parent of each of these open equivalent sets in the domain of /, 
with the topology on X^ being the corresponding p-images of these disjoint saturated open sets 
of the domain. In the limit of infinite iterations of / leading to the multifunction M (this is 
the of Sec. 4.1), the generated open sets constitute a basis for a topology on 'D{f) and the 
basis for the topology of TZ^f) are the corresponding A^-images of these equivalent classes. It is 
our contention that the motive force behind evolution toward a chaos, as defined by Def. ^.i, is 
the drive toward a state of the dynamical system that supports ininality of the limit multi Ai; 
see Appendix A2 with the discussions on Fig. Inland Eq. H26() in Sec. 2. In the limit of infinite 
iterations therefore, the open sets of the range C X are the multi images that graphical 

convergence generates at each of these inverse-stable fixed points. X therefore has two topologies 
imposed on it by the dynamics of /: the first of equivalence classes generated by the limit multi 
M in the domain of / and the second as A^-images of these classes in the range of /. Quite 
clearly these two topologies need not be the same; their intersection therefore can be defined 
to be the chaotic topology on X associated with the chaotic map f on X. Neighbourhoods 
of points in this topology cannot be arbitrarily small as they consist of all members of the 
equivalence class to which any element belongs; hence a sequence converging to any of these 
elements necessarily converges to all of them, and the eventual objective of chaotic dynamics is 
to generate a topology in X with respect to which elements of the set can be grouped together 
in as large equivalence classes as possible in the sense that if a net converges simultaneously to 
points X ^ y £ X then x ^ y: x is of course equivalent to itself while x, y, z are equivalent to 
each other iff they are simultaneously in every open set in which the net may eventually belong. 
This hall-mark of chaos can be appreciated in terms of a necessary obliteration of any separation 
property that the space might have originally possessed, see property (113) in Appendix A3. We 
reemphasize that a set in this chaotic context is required to act in a dual capacity depending 
on whether it carries the initial or final topology under M. 

This preliminary inquiry into the nature of chaos is concluded in the final section of this 
work. 
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5. Graphical convergence works 

We present in this section some real evidence in support of our hypothesis of graphical conver- 
gence of functions in Multi(X, y). The examp le is taken from neutron transport theory, and 
concerns the discretized spectral approximation Senguptal ( 19881 . 199R|) of Case's singula r eigen - 
function solution of the monoenergetic neutron transport equation, ICase and Zweife] ( 1967l l. 
The neutron transport equation is a linear form of the Boltzmann equation that is obtained as 
follows. Consider the neutron- moderator system as a mixture of two species of gases each of 
which satisfies a Boltzmann equation of the type 

d \ 

dv' j dvi j dv[^Wij{vi ^ v']Vi ^ v[){fi{r,v' ,t)fj{r,v[,t) fi{r,v,t)fj{r,vi,t)}) 

j 

where 

Wij{vi v';vi v[) =\v -vi\ Oij{v - v',vi - v[) 

Gij being the cross-section of interaction between species i and j. Denote neutrons by subscript 1 
and the background moderator with which the neutrons interact by 2, and make the assumptions 
that 

(i) The neutron density /i is much less compared with that of the moderator /2 so that the 
terms /i/i and /1/2 may be neglected in the neutron and moderator equations respectively. 

(ii) The moderator distribution /2 is not affected by the neutrons. This decouples the 
neutron and moderator equations and leads to an equilibrium Maxwellian /m for the moderator 
while the neutrons are described by the linear equation 

d \ 



dv' j dvi j dv[Wu{v ^ v';v, ^ v[){f{r,v',t)fM{v[) - -f{r,v,t)fM{vi)}) 
This is now put in the standard form of the neutron transport equation IWilliamj (|l97lh 

(^^^ + n.v + S{E)j<^{r,E,h,t) = j dn' j dE'S{r,E' E;h' ■ h) <^{r, E' ,n' ,t). 

where E = mv'^ /2 is the energy and Vt the direction of motion of the neutrons. The steady 
state, monoenergetic form of this equation is Eq. ()125() 



d^{x,fi) 



+ <^{x, /x) = - / iJ,')dn', 0<c<l, -l<fi<l 



dx 

and its singular eigenfunction solution for x G (—00, 00) is given by Eq. H128() 



see Appendix A4 for an introductory review of Case's solution of the one-speed neutron transport 
equation. 

The term "eigenfunction" is motivated by the following considerations. Consider the eigen- 
value equation 

= 0, fi€V{n),u£R (63) 

in the space of multifunctions Multi(y(/x), (—00, 00)), where /u is in either of the intervals [—1, 1] 
or [0, 1] depending on whether the given boundary conditions for Eq. (|125j) is full-range or half 
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n = x 


cl(7^) = X 


Cl(7^) ^ X 


Not injective 








Pa{Ji) 


Injective 


Not contiuous 




Ca{ii) 


Ra{L) 


Continuous 




p{L) 





Table 1: Spectrum of linear operator £ e Map(X). Here £j\ := L — X satisfies the equation Lx{x) = 0, 
with the resolvent set p{£j) of £ consisting of all those complex numbers A for which exists as a 
continuous operator with dense domain. Any value of A for which this is not true is in the spectrum a{£j) 
of that is further subdivided into three disjoint components of the point, continuous, and residual 
spectra according to the criteria shown in the table. 



range. If we are looking only for functional solutions of Eq. ()63() . then the unique function 
that satisfies this equation for all possible /Li G ^(m) ^iid G M — V{n) is = which 

means, according to Tabled that the point spectrum of /i is empty and {fj, — exists for 
all u. When v G Vip), however, this inverse is not continuous and we show below that in 
Map(y(;u), 0), u G V{fi) belongs to the continuous spectrum of fj,. This distinction between 
the nature of the inverses depending on the relative values of /i and ly suggests a wider "non- 
function" space in which to look for the solutions of operator equations, and in keeping with 
the philosophy embodied in Fig. El of treating inverse problems in the space of multifunctions, 
we consider all 'Ju G Multi(y(/x), M)) satisfying Eq. to be eigenfunctions of fi for the 

corresponding eigenvalue ly, leading to the following multifunctional solution of (|63jl 



where V{fi) — is used as a shorthand for the interval V{fi) with ly deleted. Rewriting the 
eigenvalue equation (|6.S|) as iJ,^{S'i^{fi)) = and comparing this with Fig. [3 allows us to draw 
the correspondences 

/ 1^1^ 

X and y ^ {% G Multi(l/(/i), M) : % G V{fi,,)} 

f{X) ^ {0:OGy} (64) 
Xb ^ {0:OgX} 

Thus a multifunction in X is equivalent to in X-q under the linear map and we show 
below that this multifunction is infact the Dirac delta "function" S^ilJ-), usually written as 
(5(/x — v). This suggests that in Multi(l/(//), M), every v G Vi^^i) is in the point spectrum of fi, 
so that discontinuous functions that are pointwise limits of functions in function space can be 
replaced by graphically converged multifunctions in the space of multifunctions. Completing the 
equivalence class of in Fig. [51 gives the multifunctional solution of Eq. H63() . 

From a comparison of the definition of ill-posedness (Sec. 2) and the spectrum (Table it 
is clear that ^i\{x) = y is ill-posed iff 

(1) XLa not injective X £ Pa{Jix), which corresponds to the first row of Tabled 

(2) £a not surjective 4^ the values of A correspond to the second and third columns of Table 

□ 

(3) Lx is bijective but not open A is either in Ca{iix) or Ra{Lx) corresponding to the 
second row of Table ^ 
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We verify in the three steps below that X = 1, 1] of integrable functions, v G y{^^) = 
[— 1, 1] belongs to the continuous spectrum of ^i. 

(a) TZ{^u) is dense, but not equal to Li. The set of functions g{fj,) £ Li such that ^~^g G Li 
cannot be the whole of Li. Thus, for example, the piecewise constant function g = const 7^ 
on|// — i/|<5>0 and otherwise is in Li but not in TZ{iii,) as Li. Nevertheless for 

any g £ Li, we may choose the sequence of functions 




if l/U — z^|<l/n 
otherwise 



in TZ{fj,u) to be eventually in every neighbourhood of g in the sense that lim„^oo f^i \ g—dn |= 0. 

(b) The inverse (/i — exists hut is not continuous. The inverse exists because, as noted 
earlier, is the only functional solution of Eq. . Nevertheless although the net of functions 

^"'^^^ " ts.n-\l + u)/e + ts.n-\l-v)/e ^{^i - uf + ' ^ ^ ° 
is in the domain of fi^ because 5ue{p)dn = 1 for all e > 0, 

lim / \ n — V \ 5ue{lj)d^ = 

implying that — is unbounded. 

Taken together, (a) and (b) show that functional solutions of Eq. (|63j) lead to state 2-2 in 
Tabled hence v G [—1, 1] = Ca{fi). 

(c) The two integral constraints in (b) also mean that z/ G Ca{fj,) is a generalized eigen- 

value of /.i which justifies calling the graphical limit 6ue{fJ^) ^u{f^) a generalized, or singular, 
eigenf unction, see Fig. El which clearly indicates the convergence of the net of functions^^. 

From the fact that the solution Eq. (|128() of the transport equation contains an integral 
involving the multifunction (/>(^, i'), we may draw an interesting physical interpretation. As the 
multi appears every where on V{fj,) (that is there are no chaotic orbits but only the multifunc- 
tions that produce them) , we have here a situation typical of maximal ill-posedness characteristic 
of chaos: note that both the functions comprising are non-injective. As the solution 

(|128|) involves an integral over all G V{fi), the singular eigenfunctions — that collectively may 
be regarded as representing a chaotic substate of the system represented by the solution of the 
neutron transport equation — combine with the functional components (j){zizi'Q, fi) to produce 
the well-defined, non-chaotic, experimental end result of the neutron flux <I>(x,//). 

The solution (|128() is obtained by assuming <l>(x,//) = e~^/'^(/)(/i, i/) to get the equation 
for (f>{fi,u) to be {fx — v)<t){^,v) = —cu/2 with the normalization J^-^<f){^,v) = 1. As /i""*^ 
is not invertible in Multi(y(;u), M) and u,m '■ t[-^^ d oes not exist, the alternate ap- 

proach of regularization was adopted in lSenguptal ( 198^ . 19951 ) to rewrite v) = — ci//2 as 



IJ'i'e't'eil^y ^) = — ci^/2 with fi^i: := ^ — {u + ie) being a net of bijective functions for e > 0; this 
is a consequence of the fact that for the multiplication operator every nonreal A belo ngs to the 
resolvent set of the operator. The family of solutions of the later equation is given by Senguptal 

(i2si,E22i) 

where the required normalization (p^{i',fj,) = 1 gives 

^''The technical definition of a generalized eigenvalue is as follows. Let £ be a linear operator such that 
there exists in the domain of £ a sequence of elements (x„) with ||xn|| = 1 for all n. If lini„^oo !!(£ — 
A)a;„|| = for some A G C, then this A is a generalized eigenvalue of C, the corresponding eigcnfunction 
Xoc being a generalized eigenf unction. 
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Figure 17: Graphical convergence of: (a) Poisson kernel S^{x) ~ e/TT{x^ + and (b) conjugate Poisson 
kernel Pe{x) = x/{x'^ + e^) to the Dirac delta and principal value respectively; the graphs, each for a 
definite e value, converges to the respective limits as e — *■ 0. 

"^""^ tan-i(l + i/)/e + tan-i(l-i^)/e V 4 {1 - u)^ + 

with ^ 

vr, = .|_^-^ = 2tan-i (l) 

These discretized equations should be compared with the corresponding exact ones of Appendix 
A4. We shall see that the net of functions (|65j) converges graphically to the multifunction Eq. 
(IT^ as e ^ 0. 

In the discretized spectral approximation., the singular eigenfunction (/)(/u,z/) is replaced by 
(psiPi £ ^ 0, with the integral in v being replaced by an appropriate sum. The solution Eg . 
^(11301) of the physically interesting half-space x > problem then reduces to ISenguptal (|l988l . 

^ AT 

^e{x, fi) = a(z/o)e-"/^»0(^i, i/o) + J2 «(^*)e""/''</'e(^*, i^i) M e [0, 1] (66) 

1=1 

where the nodes {fil^^^ are chosen suitably. This discretized spectral approximation to Case's 
solution has given surprisingly accurate numerical results for a set of properly chosen nodes when 
compared with exact calculations. Because of its involved nature Case and Zweifel ( 1967 '). the 
exact calculations are basically numerical which leads to nonlinear integral equations as part of 
the solution procedure. To appreciate the enormous complexity of the exact treatment of the 
half-space problem, we recall that the complete set of eigenfunctions {4>{fJ;i^o),{4>ilJ'i'^)}iy£[o,i]} 
are orthogonal with respect to the half-range weight function W{fi) of half-range theory, Eq. 
(|133j) . that is expressed only in terms of solution of the nonlinear integral equation Eq. (|134|) . 
The solution of a half-space problem then evaluates the coefficients {a(z^o)) ^i'^)i/e[o,i]} from the 
appropriate half range (that is < < 1) orthogonality integrals satisfied by the eigenfunctions 
{0(//, vq), {0(/U, i')},yg[o,i]} with respect to the weight W{fi), see Appendix A4 for the necessary 
details of the half-space problem in neutron transport theory. 

As may be appreciated from this brief introduction, solutions to half-space problems are 
not simple and actual numerical computations must rely a great deal on tabulated values of 
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the X-function. Self-consistent calculations of sample benchmark problems performed by the 
discretized spectral approximation in a full-range adaption of the half-range problem described 
below that generate all necessary data, independent of numerical tables, with the quadrature 
nodes {ui}^^ taker i at the ze r os Legendr e polynomials show that the full range formulation of 



nodes jfj j^^j^ taker i at tne ze r os .Legendr e polynomials snow tnat tne lull range lormulation oi 
this approximation Senguotal (|l988l . Tl995l 'l can give very accurate results not only of integrated 



quantities like the flux <I> and leakage of particles out of the half space, but of also basic "raw" 
data like the extrapolated end point 

and of the X-function itself. Given the involved nature of the exact theory, it is our contention 
that the remarkable accuracy of these basic data, some of which is reproduced in Table [2 is 
due to the graphical convergence of the net of functions 

(l)e{n,iy) — > (t^ifJ',^) 

shown in Fig. 1181 here e = l/irN so that e — > as X — > oo. By this convergence, the delta func- 
tion and principal values in [—1, 1] are the multifunctions ([—1, 0), 0) \J{0, [0, oo) I j((0, 1], 0) and 
{l/x}j,g[_i 0) UlO) (~°°' °°)) U{l/^}a;e(o,i] respectively. Tables |21 and 01 taken from ISenguptal 




Figure 18: Rational function approximations (f>eifJ-,v) of the singular eigenfunction <f){^,v) at four 
different values oi v. N = 1000 denotes the "converged" multifunction (j>, with the peaks at the specific 
j^-values chosen. 

(|l988h a,nd ISenguntl (|l99,'j l. show respectively the extrapolated end point and X-function by 
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the full-range adaption of the discretized spectral approximation for two different half range 
problems denoted as Problems A and B defined as 



Problem A Equation: fi^^ + ^ = (c/2) /j^^ ^{x,^')dfi', x>0 

Boundary condition : ^{0, fi) = 0, fi > 

Asymptotic condition : <I> — > e~^^'^'^4'{fi, uq), x — > oo. 
Problem B Equation: ^t^^. + = (c/2) /^^ ^{x, fi')dfi' , x>0 

Boundary condition : <I>(0, /i) = 1, /i > 

Asymptotic condition : $ — > 0, x — > oo. 

The full — 1 < < 1 range form of the half < fi < 1 range discretized spectral approximation 
replaces the exact integral boundary condition at x = by a suitable quadrature sum over the 
values of taken at the zeros of Legendre polynomials; thus the condition at x = can be 
expressed as 

N 

i'ifJ') = a{i^o)Hf^,'^o) + ^a{ui)(/)e{fi,iJi), ^g[0, 1], (68) 

1=1 

where = ^{0, fi) is the specified incoming radiation incident on the boundary from the left, 
and the half-range coefficients 0(1^0)5 {fl(^)}i/e[o,i] to be evaluated using the M^-function of 
Appendix 4. We now exploit the relative simplicity of the full-range calculations by replacing Eq. 
(|68|) by Eq. (|69|) following, where the coefficients {6(i^i)}^Q are used to distinguish the full-range 
coefficients from the half-range ones. The significance of this change lies in the overwhelming 
simplicity of the full-range weight function fi as compared to the half-range function W{fj,), and 
the resulting simplicity of the orthogonality relations that follow, see Appendix A4. The b asic 
data of zq and X^—v) are then completely generated self-consistently ISenguptal dlQsi. Il995h by 
the discretized spectral approximation from the full-range adaption 



N 



J^6i</.,(/i,i/i) = V+(/u) + ^-(/i), /i e [-1,1], Mi > (69) 



1=0 



of the discretized boundary condition Eq. (|68|). where ip+{fJ-) is by definition the incident flux 
ipifi) for G [0, 1] and if ;U G [-1,0], while 



N 



</.e(/i,i/i) if /i G [-1,0], Mi > 

j=0 

if /i G [0, 1] 



is the the emergent angular distribution out of the medium. Equation (|69|) corresponds to the 
full-range /i G [— 1, 1], Mj > form 

6(Mo)(/>(/i,Mo) + / b{i^)(l){fi,L')diy = ilj+ifi) + (b''{i^Q)(j){fi,i^o) + b' {v)(l){^,u)dv\ (70) 



JO \ JO / 

of boundary condition (|131() with the first and second terms on the right having the same 
interpretation as for Eq. (|69|) . This full-range simulation merely states that the solution (|13()|) 
of Eq. (|125|) holds for all // G [— 1, 1], x > 0, although it was obtained, unlike in the regular 
full-range case, from the given radiation V(m) incident on the boundary at x = over only 
half the interval ^ G [0, 1]. To obtain the simulated full-range coefficients and {6,^} of the 
half-range problem, we observe that there are effectively only half the number of coefficients as 
compared to a normal full-range problem because v is now only over half the full interval. This 
allows us to generate two sets of equations from (|70|) by integrating with respect to G [— 1, 1] 
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with V in the half intervals [—1,0] and [0,1] to obtain the two sets of coefficients h and h 
respectively. Accordingly we get from Eq. with j = 0, 1, • • • , the sets of equations 



(V',,/.,-_)W = -^5r(0,+ , 

/ N 




(71) 



where {4>j±)jLi represents ((/>e(/i, ±z/'j))^;^, (?l)o± = (^(/i, ±z^o)) the (+) (— ) superscripts are used 
to denote the integrations with respect to /x G [0,1] and G [—1,0] respectively, and {f,g)fi 
denotes the usual inner product in [—1, 1] with respect to the full range weight fi. While the first 
set of + 1 equations give , the second set produces the required bj from these "negative" 
coefficients. By equating these calculated bi with the exact half-range expressions for a(z/) 
with respect to W{fi) as outlined in Appendix A4, it is possible to find numerical values of zq 
and X{—u). Thus from the second of Eq. ()136() . {A(— fj)}^]^ is obtained with hiB = aiB, 
i = 1, - ■ ■ ,N, which is then substituted in the second of Eq. (|ll'{5j) with X^—uq) obtained from 
aA(i^o) according to Appendix A4, to compare the respective with the calculated biA from 
(|71|) . Finally the full-range coefficients of Problem A can be used to obtain the X{—i') values 
from the second of Eqs. ()135() and compared with the exact tabulated values as in Table |21 
The tabulated values of czq from Eq. (|67|) show a consistent deviation from our calculations 
of Problem A according to a^ii^o) = — exp(— 2zo/i^o)- Since the X(—u) values of Problem A 
in Table El also need the same 6oA as input that was used in obtaining zq , it is reasonable to 
conclude that the "exact" numerical integration of zq is inaccurate to the extent displayed in 
Table m 



c 


CZq 


N = 2 


A^ = 6 


A^ = 10 


Exact 


0.2 


0.78478 


0.78478 


0.78478 


0.7851 


0.4 


0.72996 


0.72996 


0.72996 


0.7305 


0.6 


0.71535 


0.71536 


0.71536 


0.7155 


0.8 


0.71124 


0.71124 


0.71124 


0.7113 


0.9 


0.71060 


0.71060 


0.71061 


0.7106 



Table 2: Extrapolated end-point zq. 



Prom these numerical experiments and Fig. ^Jwe may conclude that the continuous spec- 
trum [—1,1] of the position operator fi acts as the 2?+ points in generating the multifunctional 
Case singular eigenfunction cj){ii,u). Its rational approximation (pi;{fi,u) in the context of the 
simple simulated full-range computations of the complex half-range exact theory of Appendix 
A4, clearly demonstrates the utility of graphical convergence of sequence of functions to multi- 
function. The totality of the multifunctions (/>(//, v) for all z/ in Fig. 1181^ ^ and (d) endows the 
problem with the character of maximal ill-posedness that is characteristic of chaos. This chaotic 
signature of the transport equation is however latent as the experimental output ^{x, fi) is well- 
behaved and regular. This important example shows how nature can use hidden and complex 
chaotic substates to generate order through a process of superposition. 



6. Does Nature support complexity? 
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c 


AT 
i V 






Problem A 


Problem B 


Exact 


0.2 


2 


0.2133 


0.8873091 


0.8873091 


0.887308 


0.7887 


0.5826001 


0.5826001 


0.582500 






0.0338 


1.3370163 


1.3370163 


1.337015 






0.1694 


1.0999831 


1.0999831 


1.099983 


0.6 


6 


0.3807 


0.8792321 


0.8792321 


0.879232 


0.6193 


0.7215240 


0.7215240 


0.721524 






0.8306 


0.6239109 


0.6239109 


0.623911 






0.9662 


0.5743556 


0.5743556 


0.574355 






0.0130 


1.5971784 


1.5971784 


1.597163 






0.0674 


1.4245314 


1.4245314 


1.424532 






0.1603 


1.2289940 


1.2289940 


1.228995 






0.2833 


1.0513750 


1.0513750 


1.051376 


0.9 


10 


0.4255 


0.9058140 


0.9058410 


0.905842 


0.5744 


0.7934295 


0.7934295 


0.793430 






0.7167 


0.7102823 


0.7102823 


0.710283 






0.8397 


0.6516836 


0.6516836 


0.651683 






0.9325 


0.6136514 


0.6136514 


0.613653 






0.9870 


0.5933988 


0.5933988 


0.593399 



Table 3: X{~v) by the full range method. 
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The question of this section is basic in the hght of the theory of chaos presented above as it 
may be reformulated to the inquiry of what makes nature support chaoticity in the form of 
increasing non-injectivity of an input-output system. It is the purpose of this Section to exploit 
the connection between spectral theory and the dynamics of chaos that has been presented 
in the previous section. Since linear operators on finite dimensional spaces do not possess 
continuous or residual spectra, spectral theory on infinite dimensional spaces essentially involves 
limiting behaviour to infinite dimensions of the familiar matrix eigenvalue-eigenvector problem. 
As always this means extensions, dense embeddings and completions of the finite dimensional 
problem that show up as generali zed eigenvalues and eigenvectors. In its usual form, the goal of 
nonlinear spectral theory consists lAppel et al. in the study of ^ for nonlinear operators 



Tx that satisfy more general continuity conditions, like differentiability and Lipschitz continuity, 
than simple boundedness that is enough for linear operators. The following generalization of 
the concept of the spectrum of a linear operator to the nonlinear case is suggestive. For a 
nonlinear map, A need not appear only in a multiplying role, so that an eigenvalue equation can 
be written more generally as a fixed-point equation 

/(A;x) = X 

with a fixed point corresponding to the eigenfunction of a linear operator and an "eigenvalue" 
being the value of A for which this fixed point appears. The correspondence of the residual 
and continuous parts of the spectrum are, h owever, l ess tr ivial than for the point spectrum. 
This is seen from the following two examples. Let Ack = XkGk, k = 1,2, ■ ■ ■ be 



an eigenvalue equation with ej being the j^^ unit vector. Then {A — X)ek ■= {Xk — ^)^k = 
iff A = Afc so that {A^}^^ e PcriA) are the only eigenvalues of A. Consider now {Xk)kLi 
to be a sequence of real numbers that tends to a finite A*; for example let ^ be a diagonal 
matrix having 1/A; as its diagonal entries. Then A* belongs to the continuous spectrum of A 
because {A — X*)ek = {Xk — X*)ek with A^ — > A* implies that {A — X*)~^ is an unbounded linear 
operator and A* a generalized eigenvalue of A. In the second example Aek = ek+i/{k + 1), it 
is not difficult to verify that: (a) The point spectrum of A is empty, (b) The range of A is 
not dense because it does not contain ei, and (c) A~^ is unbounded because Ack — > 0. Thus 
the generalized eigenvalue A* = in this case belongs to the residual spectrum of A. In either 
case, limj_>oo &j is the corresponding generalized eigenvector that enlarges the trivial null space 
M{£jx*) of the generalized eigenvalue A*. In fact in these two and the Dirac delta example of 
Sec. 5 of continuous and residual spectra, the generalized eigenfunctions arise as the limits of a 
sequence of functions whose images under the respective C\ converge to 0; recall the definition 
of footnote 1261 This observation generalizes to the dense extension Multi|(X, y) of Map(X, y) 
as follows. 11 X £ is not a fixed point of /(A; x) = x, but there is some n S N such that 
f^(X;x) = X, then the limit n — > oo generates a multifunction at x as was the case with the 
delta function in the previous section and the various other examples that we have seen so far 
in the earlier sections. 

One of the main goals of investigations on the spectrum of nonlinear operators is to find a set 
in the complex plane that has the usu al desirable properties of the spectrum of a linear operator, 
Appel. DePascale and Vigno In this case, the focus has been to find a suitable class 



of operators C{X) with T £ C{X), such that the resolvent set is expressed as 

p{T) = {A G C: (Ta is 1 : l){Cl{n{Tx) = X) and (T^^ G C{X) on n{Tx))} 

with the spectrum cr(T) being defined as the complement of this set. Among the classes C{X) 
that have been considered, beside spaces of continuous functions C{X), are linear boundedness 
B{X), Frechet differentiability C^{X), Lipschitz continuity Lip{X), and Granas quasibounded- 
ness Q{x), where Lip{X) specifically takes into account the nonlinearity of T to define 

\\T{x)-T{y)\\ . ||T(x)-r(^)|| 
r Lip = sup 7 7 , T iip = mf (72) 

x^y \\x-y\\ x^y \\x-y\\ 
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that are plainly generalizations of the corresponding norms of linear operators. Plots of (y) 
{x e V{f - A) : (/ - X)x = y} for the functions / : M ^ M 



/Aa(a:) 




—Xx, 
(1 - Xx) - 1, 
1 — Ax, 



X < -1 

-1 < X < 1 
1 < X, 

X < 1 

1 < X < 2 

2 < X 



fxd{x) 

fXe{x) 

fxiix) 



—Xx 
\/x — 1 — Xx 



X < 1 
1 < X, 



(x - 1)2 + 1 - Ax 
(1 - A)x 



tan ^(x) — Ax, 

1 — 2\J —X — Ax, 

(l-A)x, 
2-y/x — 1 — Ax, 



1 < X < 1 
otherwise 



X < -1 
-1 < X < 1 

1 < X 



taken from lAppel. DePascale and Vignolil (HooO) are shown in Fig. 1191 It is easy to verify that 
the Lipschitz and linear upper and lower bounds of these maps are as in Table |^ 
The point spectrum defined by 

Pcj(/) = {A G C: (/ - A)x = for some x / 0} 

is the simplest to calculate. Because of the special role played by the zero element in gener- 
ating the point spectrum in the linear case, the bounds m||x|| < ||£^x|| < M||x|| together with 
Lx = Ax imply Cl(P(j(iL)) = [||£j||b5 II'^IIb] — where the subscripts denote the lower and upper 
bounds in Eq. ()72() and which is sometimes taken to be a descriptor of the point spectrum of 
a nonlinear operator — as can be seen in Table El and verified from Fig. I19L The remainder 
of the spectrum, as the complement of the resolvent set, is more difficult to find. Here the 
convenient characterization of the resolvent of a continuous linear operator as the set of all 
sufficiently large A that satisfy |A| > M is of little significance as, unlike for a linear operator, 
the non-existence of an inverse is not just due the set {/~^(0)} which happens to be the only 
way a linear map can fail to be injective. Thus the map defined piecewise as a -|- 2(1 — a)x 
for < X < 1/2 and 2(1 — x) for 1/2 < x < 1, with < a < 1, is not invertible on its range 
although {/~(0)} = 1. Comparing Fig. IT^ and Table 21 it is seen that in cases (b), (c) and 
(d), the intervals [I/lb, ||/||b] are subsets of the A- values for which the respective maps are not 
injective; this is to be compared with (a), (e) and (f) where the two sets are the same. Thus 
the linear bounds are not good indicators of the uniqueness properties of solution of nonlinear 
equations for which the Lipschitzian bounds are seen to be more appropriate. 

In view of the above, we may draw the following conclusions. If we choose to work in the 
space of multifunctions Multi(X, T), with T the topology of pointwise biconvergence, when all 
functional relations are (multi) invertible on their ranges, we may make the following definition 
for the net of functions /(A;x) satisfying /(A;x) = x. 

Definition 6.1. Let /(A; •) € Multi(X, T) be a function. The resolvent set of f is given by 



p{f) = {A: (/(A; G Map(X, T)) A (C1(7^(/(A; •)) 
and any X not in p is in the spectrum of f. □ 
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Figure 19: Inverses of / — A. The A-values are shown on the graphs. 



Function 


I/lb 


II/IIb 


l/liip 


ll/llLip 


/a 





1 





1 


/b 





1/2 





1 


fc 





1/2 





oo 


/d 


2(^2-1) 


oo 





2 


fc 





1 





1 


ff 





1 





1 



Table 4: Bounds on the functions of Fig. E| 



68 



Functions 


C^Lip(/) 


P<r{f) 


fa. 


[0, 1] 


(0,1] 


fh 


[0, 1] 


[0, 1/2] 


fc 


[0,oo) 


[0, 1/2] 


fd 


[0,2] 


[2(^2 -1),1] 


fc 


[0, 1] 


(0,1) 


fi 


[0, 1] 


(0,1) 



Table 5: Lipschitzian and point spectra of the functions of Fig. El 



Thus apart from multifunctions, A G o'(/) also generates functions on the boundary of 
functional and non- functional relations in Multi(X, T). While it is possible to classify the 
spectrum into point, continuous and residual subsets, as in the linear case, it is more meaningful 
for nonlinear operators to consider A as being either in the boundary spectrum Bdy((T(/)) or 
in the interior spectrum Int((T(/)), depending on whether or not the multifunction /(A; •)" 
arises as the graphical limit of a net of functions in either p{f) or Ra{f). This is suggested by 
the spectra arising from the second row of Tabled (injective Cx and discontinuous ^x^) that 
lies sand wiched in the A-plane between the two components arising from the first and third 
Navlor and Selll (1971) Sec. 6.6, for example. According to this simple scheme, the 



rows, see 



spectral set is a closed set with its boundary and interior belonging to Bdy((T(/)) and Int(cj(/)) 
respectively. Table 1^1 shows this division for the examples in Fig. 1191 Because is no more 
significant than any other point in the domain of a nonlinear map in inducing non-injectivity, 
the division of the spectrum into the traditional sets would be as shown in Table IHl compare also 
with the conventional linear point spectrum of TableEl In this nonlinear classification, the point 
spectrum consists of any A for which the inverse /(A; •)" is set-valued, irrespective of whether 
this is produced at or not, while the continuous and residual spectra together comprise the 
boundary spectrum. Thus a A can be both in the point and the continuous or residual spectra 
which need not be disjoint. The continuous and residual spectra are included in the boundary 
spectrum which may also contain parts of the point spectrum. 



Function 


Int(a(/)) 


Bdy(a(/)) 


P<f) 


CTif) 


R<f) 


/a 


(0,1) 


{0,1} 


[0,1] 


{1} 


{0} 


fh 


(0,1) 


{0,1} 


[0,1] 


{1} 


{0} 


fc 


(0,oo) 


{0} 


[0,cx)) 


{0} 





fd 


(0,2) 


{0,2} 


(0,2) 


{0,2} 





fc 


(0,1) 


{0,1} 


(0,1) 


{1} 


{0} 


fi 


(0,1) 


{0,1} 


(0,1) 


{0,1} 






Table 6: Nonlinear spectra of functions of Fig. E| Compare the present point spectra with the usual 
linear spectra of Table El 

Example 6.1. To see how these concepts apply to linear mappings, consider the equation 
{D — A)y(x) = r{x) where D = d/dx is the differential operator on L^[0,oo), and let A be real. 
For A 7^ 0, the unique solution of this equation in L^[0, oo), is 



y{x) = < 



y(o) + 
y{o)- 



r{x')dx , A < 



e~^^V(xVx' ) , A > 
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showing that for A > the inverse is functional so that A G (0, cxd) belongs to the resolvent of 
D. However, when A < 0, apart from the y = solution (since we are dealing a with linear 
problem, only r = is to be considered), e^^ is also in L^[0, oo) so that all such A are in 
the point spectrum of D. For A = and r ^ 0, the two solutions are not necessarily equal 
unless r{x) = 0, so that the range TZ(D — I ) is a subspace of L^[0, oo). To complete the 
problem, it is possible to show Navlor and Selll ( 197lh that S Ca(D), see Ex. 2.2; hence 
the continuous spectrum forms at the boundary of the functional solution for the resolvent-A 
and the multifunctional solution for the point spectrum. With a slight variation of problem 
to y{0) = 0, all A < are in the resolvent set, while A > the inverse is bounded but must 
satisfy y(0) = f^^ e-^''r{x)dx = so that Cl{n{D - A)) / L'^[0,oo). Hence A > belong to 
the residual spectrum. The deco mposition of the c ompl ex A-plane for these and some other 
linear spectral problems taken from Navlor and Selll (jl97 j ) is shown in Fig. 1201 In all cases, the 
spectrum due to the second row of Table ^ acts as a boundary between that arising from the 
first and third rows, which justifies our division of the spectrum for a nonlinear operator into 
the interior and boundary components. Compare Example 2.2. ■ 




Figure 20: Spectra of some linear operators in the A-plane. (a) Left shift (• • • ,x-i,xo,xi, ■ ■ ■) i-^ 
(• • • xq, xi,X2, ■ ■ ■) on ^2(— oo, oo), (b) Right shift {xq, xi, X2, ■ ■ ■) ^ (0, xq, xi,- ■ ■) on ^2[0, oo), (c) Left 
shift (xo, xi, X2^ ■ ■ ■) ^ (xi, X2, X3, • • • ) on l2[0, oo) of sequence spaces, and (d) d/dx on i2(~oo, oo) (e) 
d/dx on ^2(0, 00) with y{0) = and (f) d/dx on L2[0, 00). The residual spectrum in (b) and (e) arise from 
block (3-3) in Tabled i-S-, is one-to-one and C^^ is bounded on non-dense domains in /2[0, 00) and 
L2[0,c») respectively. The continuous spectrum therefore marks the boundary between two functional 
states, as in (a) and (e), now with dense and non-dense domains of the inverse operator. 

Prom the basic representation of the resolvent operator (1 — /)~^ 

1 + / + /' + ••• + r + •• • 

in Multi(X), if the iterates of / converge to a multifunction for some A, then that A must be in 
the spectrum of /, which means that the control parameter of a chaotic dynamical system is in 
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its spectrum. Of course, the series can sum to a multi even otherwise: take fx{x) to be identically 
X with A = 1, for example, to get 1 G Pa{f). A comparison of Tables Hand El reveal that in case 
(d), for example, and 2 belong to the Lipschtiz spectrum because although f^^ is not Lipschitz 
continuous, ||/||Lip = 2. It should also be noted that the boundary between the functional 
resolvent and multifunctional spectral set is formed by the graphical convergence of a net of 
resolvent functions while the multifunctions in the interior of the spectral set evolve graphically 
independent of the functions in the resolvent. The chaotic states forming the boundary of the 
functional and multifunctional subsets of Multi(X) marks the transition from the less efficient 
functional state to the more efficient multifunctional one. 

These arguments also suggest the following. The countably many outputs arising from the 
non-injectivity of /(A; •) corresponding a given input can be interpreted to define complexity 
because in a nonlinear system each of these possibilities constitute a experimental result in itself 
that may not be combined in any definite predetermined manner. This is in sharp contrast to 
linear systems where a linear combination, governed by the initial conditions, always generate a 
unique end result; recall also the combination offered by the singular generalized eigenfunctions 
of neutron transport theory. This multiplicity of possibilities that have no definite combinato- 
rial property is the ba sis of the div e rsitv of nature, and is possibly responsible for Feigenbaum's 
"historical prejudice", FeigenbaumI ( 19921 ^. see Prelude, 2. Thus order represented by the func- 
tional resolvent passes over to complexity of the countably multifunctional interior spectrum 
via the uncountably multifunctional boundary that is a prerequisite for chaos. We may now 
strengthen our hypothesis offered at the end of the previous section in terms of the examples of 
Figs. 1191 and Oni that nature uses chaoticity as an intermediate step to the attainment of states 
that would otherwise be inaccessible to it. Well-posedness of a system is an extremely inefficient 
way of expressing a multitude of possibilities as this requires a different input for every possible 
output. Nature chooses to express its myriad manifestations through the multifunctional route 
leading either to averaging as in the delta function case or to a countable set of well-defined 
states, as in the examples of Fig. corresponding to the interior spectrum. Of course it is no 
distraction that the multifunctional states arise respectively from fx and f^ in these examples 
as / is a function on X that is under the influence of both / and its inverse. The functional 
resolvent is, for all practical purposes, only a tool in this structure of nature. 

The equation /(x) = y is typically an input-output system in which the inverse images at 
a functional value yo represents a set of input parameters leading to the same experimental 
output yoj this is stability characterized by a complete insensitivity of the output to changes in 
input. On the other hand, a continuous multifunction at xq is a signal for a hypersensitivity to 
input because the output, which is a definite experimental quantity, is a choice from the possibly 
infinite set {/(a^o)} made by a choice function which represents the experiment at that particular 
point in time. Since there will always be finite differences in the experimental parameters when 
an experiment is repeated, the choice function (that is the experimental output) will select a 
point from {/(xq)} that is representative of that experiment and which need not bear any definite 
relation to the previous values; this is instability and signals sensitivity to initial conditions. 
Such a state is of high entropy as the number of available states fc{{f{xQ)}) — where /c is the 
choice function — is larger than a functional state represented by the singleton {/(xq)}. 



Epilogue 

The most passionate advocates of the new science go so far as to say that twentieth-century science 
will be remembered for just three things: relativity, quantum mechanics and chaos. Chaos, they 
contend, has become the century's third great revolution in the physical sciences. Like the first two 
revolutions, chaos cuts away at the tenets of Newton's physics. As one physicist put it: "Relativity 
eliminated the Newtonian illusion of absolute space and time; quantum theory eliminated the New- 
tonian dream of a controllable measurement process; and chaos eliminates the Laplacian fantasy of 
deterministic predictability. " Of the three, the revolution in chaos applies to the universe we see and 
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touch, to objects at human scale. ■ ■ ■ There has long been a feeling, not always expressed openly, 
that theoretical physics has strayed far from human intuition about the world. Whether this will 
prove to be fruitful heresy, or just plain heresy, no one knows. But some of those who thought that 
physics might be working its way into a corner now look to chaos as a way out. 



Appendix 

This Appendix gives a brief overview of some aspects of topology that are necessary for a proper 
understanding of the concepts introduced in this work. 



Al. Convergence in Topological Spaces: Sequence, Net and Filter. 

In the theory of convergence in topological spaces, countability plays an important role. To 
understand the significance of this concept, some preliminaries are needed. 

The notion of a basis, or base, is a familiar one in analysis: a base is a subcollection of 
a set which may be used to construct, in a specified manner, any element of the set. This 
simplifies the statement of a problem since a smaller number of elements of the base can be 
used to generate the larger class of every element of the set. This philosophy finds application 
in topological spaces as follows. 

Among the three properties (Nl) — (N3) of the neighbourhood system 7V^. of Tutorial4, (Nl) 
and (N2) are basic in the sense that the resulting subcollection of Mx can be used to generate 
the full system by applying (N3); this basic neighbourhood system, or neighbourhood (local) base 
Bx at X, is characterized by 

(NBl) X belongs to each member B oi Bx- 

(NB2) The intersection of any two members of Bx contains another member of Bx- Bi,B2 G 
Bx^{3BeBx:BCBir\B2)- 
Formally, compare Eq. (|18j) . 

Definition Al.l. A neighhourhood (local) base Bx at x in a topological space {X,U) is a 
subcollection of the neighbourhood system Nx having the property that each N € Nx contains 
some member of Bx- Thus 

Bx=^{BeJ\fx:xeBCNioT each N G Mx} (73) 
determines the full neighbourhood system 

Mx = {N C X: X £ B C N ior some B £ Bx} (74) 
reciprocally as all supersets of the basic elements. □ 

The entire neighbourhood system Mx , which is recovered from the base by forming all supersets 
of the basic neighbourhoods, is trivially a local base at x; non-trivial examples are given below. 

The second example of a base, consisting as usual of a subcollection of a given collection, is 
the topological base tB that allows the specification of the topology on a set X in terms of a 
smaller collection of open sets. 

Definition A1.2. A base tB in a topological space {X,U) is a subcollection of the topology 
U having the property that each U £U contains some member of tB. Thus 

tB =^ {B eU: B for each U eU} (75) 
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determines reciprocally the topology U as 

U = < U (^X:U = b\ □ (76) 

This means that the topology on X can be reconstructed form the base by taking all possible 
unions of members of the base, and a collection of subsets of a set X is a topological base iff Eq. 
(|7(i|) of arbitrary unions of elements of "[B generates a topology on X. This topology, which is 
the coarsest (that is the smallest) that contains t-S, is obviously closed under finite intersections. 
Since the open set Int(A^) is a neighbourhood of x whenever is, Eq. (|74j) and the definition 
Eq. ()17() of Nx implies that the open neighbourhood system of any point in a topological space 
is an example of a neighbourhood base at that point, an observation that has often led, together 
with Eq. (|75() . to the use of the term "neighbourhood" as a synonym for "non-empty open set". 
The distinction between the two however is significant as neighbourhoods need not necessarily 
be open sets; thus while not necessary, it is clearly sufficient for the local basic sets B to be 
open in Eqs. ((73]) and (f7i]) . If Eq. (f7i|) holds for every x G N, then the resulting Mx reduces to 
the topology induced by the open basic neighbourhood system Bx as given by Eq. ((TH|) . 

In order to check if a collection of subsets tB of X qualifies to be a basis, it is not necessary 
to verify properties (Tl) — (T3) of Tutorial4 for the class (f7f)|) generated by it because of the 
properties (TBI) and (TB2) below whose strong affinity to (NBl) and (NB2) is formalized in 
Theorem A 1.1. 

Theorem Al.l. A collection ^B of subsets of X is a topological basis on X iff 

(TBI) X = {JbgtB '^^^^^ each x £ X must belong to some B G tB which implies the 

existence of a local base at each point x £ X . 

(TB2) The intersection of any two members Bi and B2 of t^B with x £ Si P| i?2 contains 

another member of ^B: {Bi,B2 G tB)A{x G Bif]B2) ^ {3B e tB-.x^BC Bifl-Ba). □ 

This theorem, together with Eq. (|76|) ensures that a given collection of subsets of a set X 
satisfying (TBI) and (TB2) induces some topology on X; compared to this is the result that 
any collection of subsets of a set X is a subbasis for some topology on X. If X, however, already 
has a topology U imposed on it, then Eq. (f7^ must also be satisfied in order that the topology 
generated by tB is indeed U. The next theorem connects the two types of bases of Defs. Al.l 
and A1.2 by asserting that although a local base of a space need not consist of open sets and 
a topological base need not have any reference to a point of X, any subcollection of the base 
containing a point is a local base at that point. 

Theorem A1.2. A collection of open sets tB is a base for a topological space {X,U) iff for 
each X G X, the subcollection 

Bx = {BeU:xeBeTB} (77) 
of basic sets containing x is a local base at x. □ 

Proof. Necessity. Let tB be a base of {X,U) and A^ be a neighbourhood of x, so that x £ U N 
for some open set U = Ubgt^ ^ ^^'^ basic open sets B. Hence x & B C N shows, from Eq. 
(|73|) . that B £ Bx is a local basic set at x. 

Sufficiency. If U is an open set of X containing x, then the definition of local base Eq. ((75|) 
requires x £ Bx U for some subcollection of basic sets Bx in Bx] hence U = {J^f^jj Bx. By 
Eq. ((7H)) therefore, tB is a topological base for X. ■ 

Because the basic sets are open, (TB2) of Theorem Al.l leads to the following physically 
appealing paraphrase of Thm. A 1.2. 

Corollary. A collection t-S of open sets of {X,U) is a topological base that generates lA iff for 
each open set U of X and each x £ U there is an open set B £ t^B such that x £ B C U; that 
is iff 

X £U £U ^ {3B £ tB:x £ B <ZU). □ 



73 



Example Al.l. Some examples of local bases in M are intervals of the type {x — £,x -\- e), 
[x — £,x + e] for real e, {x — q,x + q) for rational q, {x — l/n,x + 1/n) for n G Z+, while for a 
metrizable space with the topology induced by a metric d, each of the following is a local base 
at X e X: B^{x;d) := {y £ X : d{x,y) < e} and D^{x]d) := {y G X : d{x,y) < e} for e > 0, 
Bq{x;d) for Q 9 g > and for n G Z+. In M^, two neighbourhood bases at any 

X G are the disks centered at x and the set of all squares at x with sides parallel to the axes. 
Although these bases have no elements in common, they are nevertheless equivalent in the sense 
that they both generate the same (usual) topology in M^. Of course, the entire neighbourhood 
system at any point of a topological space is itself a (less useful) local base at that point. By 
Theorem Al.2, Be{x;d), De{x;d), e > 0, Bg{x;d), Q 3 q > and Bi/n{x;d), n G Z+, for ah 
X £ X are examples of bases in a metrizable space with topology induced by a metric d. □ 

In terms of local bases and bases, it is now possible to formulate the notions of first and 
second countability as follows. 

Definition A1.3. A topological space is first countable if each x € X has some countable 
neighbourhood base, and is second countable if it has a countable base. □ 

Every metrizable space (A, d) is first countable as both {B{x, q)}Q^q>o and {B{x, l/7T,)}„gz+ 
are examples of countable neighbourhood bases at any x G (A, d); hence M" is first countable. It 
should be clear that although every second countable space is first countable, only a countable 
first countable space can be second countable, and a common example of a uncountable first 
countable space that is also second countable is provided by M". Metrizable spaces need not be 
second countable: any uncountable set having the discrete topology is as an example. 

Example Al.2. The following is an important example of a space that is not first countable 
as it is needed for our pointwise biconvergence of Section 3. Let Map (A, y) be the set of all 
functions between the uncountable spaces {X,U) and {Y, V). Given any integer / > 1, and any 
finite collection of points {xi)l^^ of A and of open sets {Vi)l^^ in Y, let 

B{{xi)U;{Vi)U) = {<7 G Map(A,y): {g{xi) G Vi){i = l,2,--- ,1)] (78) 

be the functions in Map(A, y) whose graphs pass through each of the sets {Vi)l^i at (2;i)f=i> 
and let be the collection of all such subsets of Map(A, y) for every choice of /, {xi)l^i, 
and {yi)l^]^. The existence of a unique topology T — the topology of pointwise convergence on 
Map(A, y) — that is generated by the open sets B of the collection t-S now follows because 

(TBI) is satisfied: For any / G Map(A, y) there must be some x G A and a corresponding 
F C y such that f{x) G V, and 

(TB2) is satisfied because 

B{{s,)U; {Vi)U) n i?((t,)/=i; (W^,)/=i) = B{{s,)U, (t,)/=i; (y,)f=i, (W^,)/=i) 

implies that a function simultaneously belonging to the two open sets on the left must pass 
through each of the points defining the open set on the right. 

We now demonstrate that (Map(A, y),T) is not first countable by verifying that it is not 
possible to have a countable local base at any / G Map(A, y). If this is not indeed true, let 
^/((^i)f=ii (^)f=i) = {5 ^ Map(A, y) : {g{xi) G Vi)l^^}, which denotes those members of 
that contain / with Vi an open neighbourhood of /(xj) in Y, be a countable local base at /, see 
Thm. Al.2. Since A is uncountable, it is now possible to choose some x* G A different from any 
of the {xi)l^i (for example, let x* G M be an irrational for rational (xj)J ), and let f{x*) G V* 
where V* is an open neighbourhood of f{x*). Then B{x*;V*) is an open set in Map(A, y) 
containing /; hence from the definition of the local base, Eq. H73|l. or equivalently from the 
Corollary to Theorem Al.2, there exists some (countable) / G N such that f € B^ C B{x*;V*). 
However, 

yi ^Vi, \i X = Xi, and 1 < i < / 
f*{x) = ly*^V* \ix = x* 
^ arbitrary, otherwise 
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is a simple example of a function on X that is in (as it is immaterial as to what values 
the function takes at points other than those defining B^), but not in B{x*; V*). From this it 
follows that a sufficient condition for the topology of pointwise convergence to be first countable 
is that X be countable. ■ 

Even though it is not first countable, (Map(X, y),T) is a Hausdorff space when Y is Haus- 
dorff. Indeed, 'd f,g & (Map(X, Y),T) with f ^ g, then f{x) 7^ g{x) for some x £ X. But then 
as Y is Hausdorff, it is possible to choose disjoint open intervals Vf and Vg at /(x) and g{x) 
respectively. 

With this background on first and second countability, it is now possible to go back to the 
question of nets, filters and sequences. Technically, a sequence on a set X is a map x : N — > X 
from the set of natural numbers to X; instead of denoting this is in the usual functional manner 
of x{i) with i S N, it is the standard practice to use the notation (xj)jgN for the terms of a 
sequence. However, if the space {X,U) is not first countable (and as seen above this is not a 
rare situation) , it is not difficult to realize that sequences are inadequate to describe convergence 
in X simply because it can have only countably many values whereas the space may require 
uncountably many neighbourhoods to completely define the neighbourhood system at a point. 
The resulting uncountable generalizations of a sequence in the form of nets and filters is achieved 
through a corresponding generalization of the index set N to the directed set D. 

Definition A1.4. A directed set B is a preordered set for which the order ^, known as a 
direction of B, satisfies 

(a) Q G B =^ a ^ a (that is ^ is reflexive). 

(b) a, /3, 7 G B such that (q;^/3A/3^7) =^0^7 (that is ^ is transitive). 

(c) a, /3 G B ^ 37 G B such that (a ^ 7) A (/3 ^ 7). □ 

While the first two properties are obvious enough and constitutes the preordering of B, the third 
which replaces antisymmetry, ensures that for any finite number of elements of the directed set 
(recall that a preordered set need not be fully ordered), there is always a successor. Examples 
of directed sets can be both straight forward, as any totally ordered set like N, M, Q, or Z and 
all subsets of a set X under the superset or subset relation (that is (V^X),^) or (P(X),C) 
that are directed by their usual ordering, and not quite so obvious as the following examples 
which are significantly useful in dealing with convergence questions in topological spaces, amply 
illustrate. 

The neighbourhood system 

oN = {N: N 

at a point x £ X, directed by the reverse inclusion direction ^ defined as 

M N C M for M, N G TV^, (79) 

is a fundamental example of a natural direction of Mx- In fact while reflexivity and transitivity 
are clearly obvious, (c) follows because for any M,N G Mx, M ^ Mf^N and N ^ Mf]N. 
Of course, this direction is not a total ordering on Mx- A more naturally useful directed set in 
convergence theory is 

oNt = {{N,t):{NGMx){t£N)} (80) 

under its natural direction 

(M, s) ^ (iV, t) N CM for Af, N £ Mx] (81) 

oNf is more useful than oN because, unlike the later, oNt does not require a simultaneous 
choice of points from every N G Mx that implicitly involves a simultaneous application of the 
Axiom of Choice; see Examples Al.2(2) and (3) below. The general indexed variation 

„Nf3 = {{N,f3): {N G Mx){f3 G B){xf3 G N)} (82) 
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of Eq. ()8U() , with natural direction 

(M, a) < {N, /?) ^ (a ^ /3) A (iV C M), (83) 

often proves useful in applications as will be clear from the proofs of Theorems Al.3 and Al.4. 

Definition A1.5. Net. Let X be any set and B a directed set. A net x'-^^X in X is a 

function on the directed set D with values in X. □ 

A net, to be denoted as a £ B, is therefore a function indexed by a directed set. We 

adopt the convention of denoting nets in the manner of functions and do not use the sequential 
notation Xa that can also be found in the literature. Thus, while every sequence is a special 
type of net, x — *■ A" is an example of a net that is not a sequence. 

Convergence of sequences and nets are described most conveniently in terms of the notions 
of being eventually in and frequently in every neighbourhood of points. We describe these 
concepts in terms of nets which apply to sequences with obvious modifications. 

Definition A1.6. A net %: D — > A is said to he 

(a) Eventually in a subset A of X if its tail is eventually in A: (3/3 G ID) : (V7 ^ /?)(x(7) ^ A). 

(b) Frequently in a subset A of X if for any index P GO, there is a successor index 7 G D 
such that xil) is in A: (V/3 G B)(37 ^ (3) : (^(7) G A). □ 

It is not difficult to appreciate that 

(i) A net eventually in a subset is also frequently in it but not conversely, 

(ii) A net eventually (respectively, frequently) in a subset cannot be frequently (respectively, 
eventually) in its complement. 

With these notions of eventually in and frequently in, convergence characteristics of a net 
may be expressed as follows. 

Definition A1.7. A net x'-^ ^ X converges to x £ X if it is eventually in every neighbour- 
hood of X, that is 

(VAgAA,)(3^GB)(x(z^^/u)g A). 
The point x is known as the limit of x 8t,nd the collection of all limits of a net is the limit set 

lim(x) = {x G A: (VA G M^){3R^ G Res(B))(x(M;5) C A)} (84) 

of x, with the set of residuals Res(B) in B given by 

Res(B) = {Ma G ■P(B) : M^. = {/? G B for ah /? ^ a G B}}. (85) 

The net adheres at x £ A^^ if it is frequently in every neighbourhood of x, that is 

((VA G AA,.)(V^ G B))((3i/ y fi) : x{i^) G N). 

The point x is known as the adherent of x s.nd the collection of all adherents of x is the adherent 
set of the net, which may be expressed in terms of the cofinal subset of B 

Cof(B) = {C„ G P(B) : Cc, = {/? G B for some /? ^ a G B}} (86) 

(thus Bq is cofinal in B iff it intersects every residual in B), as 

adh(x) = {x G A: (VA G N.,){3Cp G Cof(B))(x(C;3) ^ A)}. (87) 

This recognizes, in keeping with the limit set, each subnet of a net to be a net in its own right, 
and is equivalent to 

adh(x) = {2;G A: (VAgA4)(VM„ GRes(B))(x(Ma)n^^/0)}- □ (88) 

^^This is also known as a duster point; we shall, however, use this new term exclusively in the sense of the 
elements of a derived set, see Definition 2.3. 
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Intuitively, a sequence is eventually in a set A if it is always in it after a finite number of 
terms (of course, the concept of a finite number of terms is unavailable for nets; in this case 
the situation may be described by saying that a net is eventually in A if its tail is in A) and 
it is frequently in A if it always returns to A to leave it again. It can be shown that a net is 
eventually (resp. frequently) in a set iff it is not frequently (resp. eventually) in its complement. 

The following examples illustrate graphically the role of a proper choice of the index set D 
in the description of convergence. 

Example A1.3. (1) Let 7 S B. The eventually constant net x{^) = ^ ^or S ^ j converges to 

X. 

(2) Let Mx be a neighbourhood system at a point x in X and suppose that the net 
{x{N))NejVa, is defined by 

x(M)'i%GM; (89) 

here the directed index set oN is ordered by the natural direction (|79j) of Mx- Then xi^) ~^ ^ 
because given any x-neighbourhood M E oN, it follows from 

M ^ N £ jiN =^ X{N) = t€ N CM (90) 

that a point in any subset of M is also in M; xi^) is therefore eventually in every neighbourhood 
of X. 

(3) This slightly more general form of the previous example provides a link between the 
complimentary concepts of nets and filters that is considered below. For a point x G X, and 
M, N € Mx with the corresponding directed set oMs of Eq. (jHOJ ordered by its natural order 
(jHU, the net 

x{M,s)''^'s (91) 
converges to x because, as in the previous example, for any given (M, s) £ oNg, it follows from 

(M,s) ^ {N,t) e pM, =^ xiN,t) = t£N<ZM (92) 

that xi^j t) is eventually in every neighbourhood M of x. The significance of the directed set 
oNf of Eq. H8U|) . as compared to dN, is evident from the net that it induces without using the 
Axiom of Choice: For a subset A of X, the net xi^^ t) = t £ A indexed by the directed set 

nNt = {{N,t):{N €j\fx){t£Nf]A)} (93) 

under the direction of Eq. H81|) . converges to x £ X with all such x defining the closure Cl(^) 
of A. Furthermore taking the directed set to be 

M = {{N, t):{N£ Mx){t G iV n ^ - {x})} (94) 

which, unlike Eq. 1)93(1 . excludes the point x that may or may not be in the subset A oi X, 
induces the net x{^-, t) = t £ A — {x} converging to x G X, with the set of all such x yielding 
the derived set Der(A) of A. In contrast, Eq. (|93|) also includes the isolated points t = x of A so 
as to generate its closure. Observe how neighbourhoods of a point, which define convergence of 
nets and filters in a topological space X, double up here as index sets to yield a self-consistent 
tool for the description of convergence. 

As compared with sequences where, the index set is restricted to positive integers, the 
considerable freedom in the choice of directed sets as is abundantly borne out by the two 
preceding examples, is not without its associated drawbacks. Thus as a trade-off, the wide 
range of choice of the directed sets may imply that induction methods, so common in the 
analysis of sequences, need no longer apply to arbitrary nets. 

(4) The non-convergent nets (actually these are sequences) 
(a) (1, —1, 1, —1, • • • ) adheres at 1 and —1 and 
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\ Ti if 72 IS odd 

(h) Xn = s ^ -,,/-, N -r- • , adheres at 1 for its even terms, but is un- 

[ 1 — 1/(1 + nj if n IS even 

bounded in the odd terms. ■ 

A converging sequence or net is also adhering but, as examples (4) show, the converse is 
false. Nevertheless it is true, as again is evident from examples (4), that in a first countable 
space where sequences suffice, a sequence adheres at x iff some subsequence {xn„Jm£N 
of (x„) converges to x. If the space is not first countable this has a corresponding equivalent 
formulation for nets with subnets replacing subsequences as follows. 

Let (x(«))aGiD) be a net. A subnet of is the net = /? G E, where 

a : (E, <) (D, ^) is a function that captures the essence of the subsequential mapping 
n I— > rim in N by satisfying 

(SNl) fj is an increasing order-preserving function: it respects the order of E: cr{P) ^ 
for every /?</?' G E, and 

(SN2) For every a € B there exists a /? G E such that a ^ 
These generalize the essential properties of a subsequence in the sense that (1) Even though the 
index sets B and E may be different, it is necessary that the values of E be contained in B, and 
(2) There are arbitrarily large a G B such that x(ct = cr{f3)) is a value of the subnet ({P) for 
some /3 G E. Recalling the first of the order relations Eq. (|38|) on Map{X,Y), we will denote a 
subnet C of x by C ^ X- 

We now consider the concept of filter on a set X that is very useful in visualizing the 
behaviour of sequences and nets, and in fact filters constitute an alternate way of looking at 
convergence questions in topological spaces. A filter on a set X is a collection of nonempty 
subsets of X satisfying properties (Fl) — (F3) below that are simply those of a neighbourhood 
system Mx without specification of the reference point x. 

(Fl) The empty set does not belong to 

(F2) The intersection of any two members of a filter is another member of the filter: -Fi, -F2 G 

(F3) Every superset of a member of a filter belongs to the filter: {F £ J^)A{F C G) ^ G £ J^; 
in particular X £ T. 

Example A1.4. (1) The indiscrete filter is the smallest filter on X. 

(2) The neighbourhood system Nx is the important neighbourhood filter at x on X, and any 
local base at x is also a filter-base for A^. In general for any subset A oi X^ {N C X : A C 
Int(A^)} is a filter on X at A. 

(3) All subsets of X containing a point x G X is the principal filter fV{x) on X at x. More 
generally, if J- consists of all supersets of a nonempty subset A of X, then is the principal 
filter p'P{A) = {N X : A C Int(A^)} at A. By adjoining the empty set to this filter give the 
p-inclusion and A-inclusion topologies on X respectively. The single element sets {{x}} and 
{A} are particularly simple examples of filter-bases that generate the principal filters at x and 
A. 

(4) For an uncountable (resp. infinite) set X, all cocountable (resp. cofinite) subsets of X 
constitute the cocountable (resp. cofinite or Frechet) filter on X. Again, adding to these filters 
the empty set gives the respective topologies. ■ 

Like the topological and local bases and Bx respectively, a subclass of T may be used to 
define a filter-base pB that in turn generate J- on X, just as it is possible to define the concepts 
of limit and adherence sets for a filter to parallel those for nets that follow straightforwardly 
from Def. A1.7, taken with Def. Al.ll. 

Definition A1.8. Let {X,T) he a topological space and T a Glter on X. Then 

lim(J^) = {x G X: (ViV G Mx){3F e r){F C N)} (95) 

and 

adh(J^^) ={xeX: (ViV G A/;)(VF G J^){Ff]N ^ 0)} (96) 



78 



are respectively the sets of limit points and adherent points of T . □ 

A comparison of Eqs. (|84|) and (|88j) with Eqs. (jHSJ and respectively demonstrate their 
formal similarity; this inter-relation between filters and nets will be made precise in Definitions 
Al.lO and Al.ll below. It should be clear from the preceding two equations that 

lim(J") C adh(:r), (98) 

with a similar result 

lim(x) C adh(x) (99) 

holding for nets because of the duality between nets and filters as displayed by Defs. A1.9 and 
Al.lO below, with the equality in Eqs. (|98j) and (|99j) being true (but not characterizing) for 
ultrafilters and ultranets respectively, see Example 4.2(3) for an account of this notion . It 
should be clear from the equations of Definition A1.8 that 

adh(J^) = {x G X : (3 a finer filter Q ^ T X) {Q ^ x)} (100) 

consists of all the points of X to which some finer filter Q (in the sense that T ^ Q implies 
every element of T is also in Q) converges in X; thus 

adh(.F) =Ulhn(g:g D.^), 

which corresponds to the net-result of Theorem Al.5 below, that a net x adheres at x iff there 
is some subnet of x that converges to a; in X. Thus if ^ x is a subnet of x and .7^ C ^ is a 
filter coarser than Q then 

lim(x;) ^ lim((^) lim(.7^) C lim(^) 
adh(C) C adh(x) adh(g) C adh(J^); 

a filter Q finer than a given filter T corresponds to a subnet C of a given net x- The implication 
of this correspondence should be clear from the association between nets and filters contained 
in Definitions Al.lO and Al.ll. 

A filter-base in X is a nonempty family {Ba)a&i = F-S of subsets of X characterized by 

(FBI) There are no empty sets in the collection pB: (Va G lD))(Ba 7^ 0) 
(FB2) The intersection of any two members of pB contains another member of pB: B^, Bp G 
F^^(3SG F^:SCB„n^/3); 

hence any class of subsets of X that does not contain the empty set and is closed under finite 
intersections is a base for a unique filter on X] compare the properties (NBl) and (NB2) of a 
local basis given at the beginning of this Appendix. Similar to Def. Al.l for the local base, it 
is possible to define 

Definition A1.9. A tilter-hase f-B in a set X is a suhcoUection of the filter T on X having 
the property that each F ^ T contains some memher of pB. Thus 

pB=^{BeJ^:BCF for each F G J^} (101) 

determines the filter 

T = {F <^X:B<ZF ioT some B G f^} (102) 

reciprocally as all supersets of the basic elements. □ 
^^The restatement 

J^^x^Af^CT (97) 

of Eq. H95|l that follows from (F3), and sometimes taken as the definition of convergence of a filter, is 
significant as it ties up the algebraic filter with the topological neighbourhood system to produce the 
filter theory of convergence in topological spaces. From the defining properties of J- it follows that for 
each X & X, Mx is the coarsest (that is smallest) filter on X that converges to x. 
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This is the smallest filter on X that contains f-S and is said to be the filter generated by its 
filter-base pB; alternatively pB is the filter-base of The entire neighbourhood system A4, the 
local base Bx, Mx {^A iov x ^ C\{A)^ and the set of all residuals of a directed set ID are among 
the most useful examples of filter-bases on X, A and D respectively. Of course, every filter is 
trivially a filter-base of itself, and the singletons {{x}}, {A} are filter-bases that generate the 
principal filters fP{x) and fP{A) at x, and A respectively. 

Paralleling the case of topological subbase t^, a filter subbase pS can be defined on X to be 
any collection of subsets of X with the finite intersection property (as compared with where 
no such condition was necessary, this represents the fundamental point of departure between 
topology and filter) and it is not difficult to deduce that the filter generated by pS on X is 
obtained by taking all finite intersections f^a of members of f^ followed by their supersets 
F^SA- J-{fS) := F^EA is the smallest filter on X that contains f^ and is the filter generated 
by fS. 

Equation (|96|) can be put in the more useful and transparent form given by 
Theorem A1.3. For a filter T in a space {X,T) 

adh(^) = Pi C1(F) (103) 
and dually adh(x), are closed sets. □ 

Proof. Follows immediately from the definitions for the closure of a set Eq. ()2U() and the 
adherence of a filter Eq. (|96|) . As always, it is a matter of convenience in using the basic filters 
pB instead of to generate the adherence set. ■ 

It is infact true that the limit sets lim(jr) and lim(x) are also closed set of X; the arguments 
involving ultrafilters are omitted. 

Similar to the notion of the adherence set of a filter is its core — a concept that unlike 
the adherence, is purely set-theoretic being the infimum of the filter and is not linked with any 
topological structure of the underlying (infinite) set X — defined as 

core (T) = f] F. (104) 

From Theorem Al.3 and the fact that the closure of a set A is the smallest closed set that 
contains A, see Eq. (|'25() at the end of Tutorial4, it is clear that in terms of filters 

A = core(F7^(^)) 
Cl{A) = adh(FP(A)) (105) 
= core(Cl(F7^(^))) 

where p'P{A) is the principal filter at A; thus the core and adherence sets of the principal filter at 
A are equal respectively to A and Cl{A) — a classic example of equality in the general relation 
CI (Pi Aa) ^ n Cl{Aa) — but both are empty, for example, in the case of an infinitely decreasing 
family of rationals centered at any irrational (leading to a principal filter-base of rationals at the 
chosen irrational). This is an important example demonstrating that the infinite intersection 
of a non-empty family of (closed) sets with the finite intersection property may be empty, a 
situation that cannot arise on a finite set or an infinite compact set. Filters on X with an 
empty core are said to be free, and are fixed otherwise: notice that by its very definition filters 
cannot be free on a finite set, and a free filter represents an additional feature that may arise in 
passing from finite to infinite sets. Clearly (adh(.F) = 0) (core(^) = 0), but as the important 
example of the rational space in the reals illustrate, the converse need not be true. Another 
example of a free filter of the same type is provided by the filter-base {[a, oo) : a e M} in M. Both 
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these examples illustrate the important property that a filter is free iff it contains the cofinite 
filter, and the cofinite filter is the smallest possible free filter on an infinite set. The free cofinite 
filter, as these examples illustrate, may be typically generated as follows. Let ^ be a subset 
of X, X € Yidyx-Ai-^)-! ^-^^ consider the directed set Eq. to generate the corresponding 
net in A given by x(A^ € Mx,t) = t £ A. Quite clearly, the core of any Prechet filter based on 
this net must be empty as the point x does not lie in A. In general, the intersection is empty 
because if it were not so then the complement of the intersection — which is an element of the 
filter — would be infinite in contravention of the hypothesis that the filter is Frechet. It should 
be clear that every filter finer than a free filter is also free, and any filter coarser than a fixed 
filter is fixed. 

Nets and filters are complimentary concepts and one may switch from one to the other as 
follows. 

Definition Al.lO. Let T he a filter on X and let = {{F,x) : {F G J^){x G F)} be a 
directed set with its natural direction {F,x) ^ {G,y) (G C F). The net XT '■ oFx — > X 
defined by 

xAF, x) =x 

is said to be associated with the filter T , see Eq. □ 

Definition Al.ll. Let x : D ^ X be a net and = {/3 G D : /? ^ q G B} a residual in B. 
Then 

pB^ "^4^ {x{Ra) ■■ Res(B) ^ X for all a G B} 

is the filter-base associated with Xj ^-^s corresponding filter J^^ obtained by taking all 
supersets of the elements of pBy. is the filter associated with x- D 

yB^ is a filter-base in X because xCfl ^a) ^ fl x(I^q:)) that holds for any functional relation, 
proves (FB2). It is not difficult to verify that 

(i) X is eventually in A =^ A G J-^^ and 

(ii) X is frequently in ^ ^ (VM„ G Res(B))(A fl x{^a) / 0) ^ ^ fl -^x / *3 • 

Limits and adherences are obviously preserved in switching between nets (respectively, filters) 
and the filters (respectively, nets) that they generate: 

lim(x) = lim(.F^), adh(x) = adh(.;^^) (106) 
lim(J^) = lim(x^), adh(J^) = adh(x^). (107) 

The proofs of the two parts of Eq. ()1U6|) . for example, go respectively as follows, x G lim(x) <^=> 
X is eventually in Nx <^ (VA^ G Mx)i^F G JF^) such that (F C iV) <^ x G hm(JP"^), and x G 
adh(x) ^ X is frequently in Mx (ViV G A/;)(VF e J^^) {N f] F ^ 0) ^ x e adh{J^^); here F 
is a superset of x(Ro)- 

Some examples of convergence of filters are 

(1) Any filter on an indiscrete space X converges to every point of X. 

(2) Any filter on a space that coincides with its topology (minus the empty set, of course) 
converges to every point of the space. 

(3) For each x £ X, the neighbourhood filter Mx converges to x; this is the smallest filter 
on X that converges to x. 

(4) The indiscrete filter = {X} converges to no point in the space (A, {0, A,X — A, A}), 
but converges to every point of A — yl if A has the topology {0, A, A} because the only neigh- 
bourhood of any point in A — A is A which is contained in the filter. 

One of the most significant consequences of convergence theory of sequences and nets, as 
shown by the two theorems and the corollary following, is that this can be used to describe the 
topology of a set. The proofs of the theorems also illustrate the close inter-relationship between 
nets and filters. 
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Theorem A1.4. For a subset A of a topological space X, 

Cl{A) ={x £ X: {3 anet xin A) (x^ x)}. □ (108) 

Proof. Necessity. For x G 01(^4), construct a net x — > 2; in ^ as follows. Let Bx be a topological 
local base at x, which by definition is the collection of all open sets of X containing x. For each 
/? G D, the sets 

Nf3= f^{B^:B^eBx} 

form a nested decreasing local neighbourhood filter base at x. With respect to the directed set 
nNp = {{Nfj, I5): {(3 £^){xis ^ Np)} oi Eq. (jHU), define the desired net in A by 

x{Np,fi) = xp^Np^A 

where the family of nonempty decreasing subsets Np P| ^4 of X constitute the filter-base in A as 
required by the directed set j^Np. It now follows from Eq. (|83|) and the arguments in Example 
Al.3(3) that xp — > x; compare the directed set of Eq. (|93j) for a more compact, yet essentially 
identical, argument. Carefully observe the dual roles oiNx as a neighbourhood filter base at x. 

Sufficiency. Let x be a net in A that converges to x S X. For any A^^ G -Mc, there is a 
Mq, G Res(][l)) of Eq. (jHSJ such that x(I^a) ^ ^a- Hence the point x(a) = Xa oi A belongs to 
Na so that Afl / which means, from Eq. ((201), that x £ Cl{A). ■ 

Corollary. Together with Eqs. and ((22j, is follows that 

Der(^) = {x G X : (3 a net C in A - {x}){C ^ x)} □ (109) 

The fiher forms of Eqs. (TRISll and (flTi9|l 

Cl(^) = {xGX: (3amterJ^onX)(^GJ^)(J^^x)} (110) 
Der(^) = {x G X: (3 afilter J^on X)(^- {x} G J^)(J^^x)} 

then follows from Eq. (j^TJ and the finite intersection property (F2) of so that every neigh- 
bourhood of X must intersect A (respectively A — {x}) in Eq. (|ll()j) to produce the converging 
net needed in the proof of Theorem A1.3. 

We end this discussion of convergence in topological spaces with a proof of the following 
theorem which demonstrates the relationship that "eventually in" and "frequently in" bears with 
each other; Eq. (|111|1 below is the net-counterpart of the filter equation (|lflO|) . 

Theorem A1.5. If x is a net in a topological space X, then x G adh(x) iff some subnet 
C{P) = x(<7(/3)) of x('^)> with, a G D and /3 G E , converges in X to x; thus 

adh(x) = {x G A: (3 a subnet C ^ X in A)(C ^ x)}. □ (111) 

Proof. Necessity. Let x G adh(x). Define a subnet function a : ^ B by a{Na,a) = a 

where is the directed set of Eq. (SNl) and (SN2) are quite evidently satisfied 

according to Eq. (|83|) . Proceeding as in the proof of the preceding theorem it follows that 
Xf3 = xi^i^a,Oi)) = C(Aa,a) — > X is the required converging subnet that exists from Eq. (|H7|) 
and the fact that x(I^o) fl / for every Na G Mx, by hypothesis. 

Sufficiency. Assume now that x has a subnet ({Na,a) that converges to x. If x does not 
adhere at x, there is a neighbourhood Na of x not frequented by it, in which case x must be 
eventually in X — Na. Then C(Aq, a) is also eventually in X — Na so that cannot be eventually 
in Na, a contradiction of the hypothesis that (^[Na,a) — > x.^^ ■ 

^^In a first countable space, while the corresponding proof of the first part of the theorem for sequences 
is essentially the same as in the present case, the more direct proof of the converse illustrates how the 
convenience of nets and directed sets may require more general arguments. Thus if a sequence (a;i)igN 
has a subsequence (a;,;j.)fcgN converging to x, then a more direct line of reasoning proceeds as follows. 
Since the subsequence converges to x, its tail {xi^)k>i must be in every neighbourhood N of x. But as 
the number of such terms is infinite whereas {ik : k < j} is only finite, it is necessary that for any given 
n gN, cofinitely many elements of the sequence ixi^)if.>n be in N. Hence x G adh((xi)igN)- 
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Eqs. H1U8() and (jlllj) imply that the closure of a subset A of X is the class of X-adherences 
of all the (sub)nets of X that are eventually in A. This includes both the constant nets yielding 
the isolated points of A and the non-constant nets leading to the cluster points of A, and implies 
the following physically useful relationship between convergence and topology that can be used 
as defining criteria for open and closed sets having a more appealing physical significance than 
the original definitions of these terms. Clearly, the term "net" is justifiably used here to include 
the subnets too. 

The following corollary of Theorem Al.5 summarizes the basic topological properties of sets 
in terms of nets (respectively, filters). 

Corollary. Let ^4 be a subset of a topological space X. Then 

(1) A is closed in X iff every convergent net of X that is eventually in A actually converges 
to a point in A (respectively, iff the adhering points of each filter-base on A all belong to A). 
Thus no X-convergent net in a closed subset may converge to a point outside it. 

(2) A is open in X iff every convergent net of X that converges to a point in A is eventually 
in A. Thus no X-convergent net outside an open subset may converge to a point in the set. 

(3) A is closed- and-op en (clopen) in X iff every convergent net of X that converges in A is 
eventually in A and conversely. 

(4) X £ Der(^) iff some net (respectively, filter-base) in ^ — {x} converges to x; this clearly 
eliminates the isolated points of A and x G Cl(^) iff some net (respectively, filter-base) in A 
converges to x. □ 

Remark. The differences in these characterizations should be fully appreciated: If we consider 
the cluster points Der(y4) of a net x in ^4 as the resource generated by x, then a closed subset 
of X can be considered to be selfish as its keeps all its resource to itself: Der{A) riA = Der(^). 
The opposite of this is a donor set that donates all its generated resources to its neighbour: 
Der (^) r\X-A = Der{A) , while for a neutral set, both Der (A) n A / and Der (A) n X - yl / 
implying that the convergence resources generated in A and X — A can be deposited only in 
the respective sets. The clopen sets (see diagram 2-2 of Fig. I22() are of some special interest as 
they are boundary less so that no net-resources can be generated in this case as any such limit 
are required to be simultaneously in the set and its complement. 

Example Al.l, Continued. This continuation of Example Al.2 illustrates how sequential 
convergence is inadequate in spaces that are not first countable like the uncountable set with 
cocountable topology. In this topology, a sequence can converge to a point x in the space iff it 
has only a finite number of distinct terms, and is therefore eventually constant. Indeed, let the 
complement 

G=^X-F, F = {xi-. Xi^ X, i eN} 

of the countably closed sequential set F be an open neighbourhood of x G X. Because a 
sequence (xj)jgN in X converges to a point x G X iff it is eventually in every neighbourhood 
(including G) of x, the sequence represented by the set F cannot converge to x unless it is of 
the uncountable type^*^ 

(Xo, Xi, • • • , X/, X7+1, X/+1, • • • ) (112) 

with only a finite number / of distinct terms actually belonging to the closed sequential set 
F = X — G, and x/+i = x. Note that as we are concerned only with the eventual behaviour 
of the sequence, we may discard all distinct terms from G by considering them to be in F, and 
retain only the constant sequence (x, x, • • • ) in G. In comparison with the cofinite case that was 
considered in Sec. 4, the entire countably infinite sequence can now lie outside a neighbourhood 
of X thereby enforcing the eventual constancy of the sequence. This leads to a generalization of 
our earlier cofinite result in the sense that a cocountable filter on a cocountable space converges 
to every point in the space. 

^"This is uncountable because interchanging any two eventual terms of the sequence does not alter the 
sequence. 
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It is now straightforward to verify that for a point xq in an uncountable cocountable space 

X 

(a) Even though no sequence in the open set G = X—{xq} can converge to xq, yet xq € C1(G) 
since the intersection of any (uncountable) open neighbourhood U of xq with G, being an 
uncountable set, is not empty. 

(b) By corollary (1) of Theorem Al.5, the uncountable open set G = X — {xq} is also closed 
in X because if any sequence (xi, X2, • • • ) in G converges to some x £ X, then x must be in G as 
the sequence must be eventually constant in order for it to converge. But this is a contradiction 
as G cannot be closed since it is not countable. By the same reckoning, although {xq} is not 
an open set because its complement is not countable, nevertheless it follows from Eq. (|112|) 
that should any sequence converge to the only point xq of this set, then it must eventually be 
in {xo} so by corollary (2) of the same theorem, {xq} becomes an open set. 

(c) The identity map 1 : X — > X^, where is X with discrete topology, is not continuous 
because the inverse image of any singleton of is not open in X. Yet if a sequence converges 
in X to X, then its image (l(x)) = (x) must actually converge to x in X^ because a sequence 
converges in a discrete space, as in the cofinite or cocountable spaces, iff it is eventually constant; 
this is so because each element of a discrete space being clopen is boundaryless. 

This pathological behaviour of sequences in a non Hausdorff, non first countable space does 
not arise if the discrete indexing set of sequences is replaced by a continuous, uncountable 
directed set like M for example, leading to nets in place of sequences. In this case the net can 
be in an open set without having to be constant-valued in order to converge to a point in it as 
the open set can be defined as the complement of a closed countable part of the uncountable 
net. The careful reader could not have failed to notice that the burden of the above arguments, 
as also of that in the example following Theorem 4.6, is to formalize the fact that since a closed 
set is already defined as a countable (respectively finite) set, the closure operation cannot add 
further points to it from its complement, and any sequence that converges in an open set in these 
topologies must necessarily be eventually constant at its point of convergence, a restriction that 
no longer applies to a net. The cocountable topology thus has the very interesting property of 
filtering out a countable part from an uncountable set, as for example the rationals in M. ■ 

This example serves to illustrate the hard truth that in a space that is not first countable, 
the simplicity of sequences is not enough to describe its topological character, and in fact 
"sequential convergence will be able to describe only those topologies in which the number 
of (basic) neighbourhoods around each point is no greater than the number of terms in the 
sequences , IWillardI ()l97nl V It is important to appreciate the significance of this interplay of 
convergence of sequences and nets (and of continuity of functions of Appendix Al) and the 
topology of the underlying spaces. 

A comparison of the defining properties (Tl), (T2), (T3) of topology T with (Fl), (F2), (F3) 
of that of the filter !F, shows that a filter is very close to a topology with the main difference 
being with regard to the empty set which must always be in T but never in J^. Addition of the 
empty set to a filter yields a topology, but removal of the empty set from a topology need not 
produce the corresponding filter as the topology may contain nonintersecting sets. 

The distinction between the topological and filter-bases should be carefully noted. Thus 

(a) While the topological base may contain the empty set, a filter-base cannot. 

(b) From a given topology, form a common base by dropping all basic open sets that do not 
intersect. Then a (coarser) topology can be generated from this base by taking all unions, and 
a filter by taking all supersets according to Eq. H1U2|) . For any given filter this expression may 
be used to extract a subclass pB base for J^. 



Note that {x} is a 1-point set but (x) is an uncountable sequence. 
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A2. Initial and Final topology 



The commutative diagram of Fig. [^contains four sub-diagrams X — X^ — f{X), Y — X-q — f{X), 
X — X^ — Y and X — f{X) — Y. Of these, the first two are especially significant as they can 
be used to conveniently define the topologies on X^ and f{X) from those of X and Y, so that 
/b> /b^ and G have some desirable continuity properties; we recall that a function f : X ^ Y is 
continuous if inverse images of open sets of Y are open in X. This simple notion of continuity 
needs refinement in order that topologies on X^ and f{X) be unambiguously defined from 
those of X and Y, a requirement that leads to the concepts of the so-called final and initial 
topologies. To appreciate the significance of these new constructs, note that if / : {X,U) — > {Y, V) 
is a continuous function, there may be open sets in X that are not inverse images of open — 
or for that matter of any — subset of Y, just as it is possible for non-open subsets of Y to 
contribute to U. When the triple {U, f, V} are tuned in such a manner that these are impossible, 
the topologies so generated on X and Y are the initial and final topologies respectively; they 
are the smallest (coarsest) and largest (finest) topologies on X and Y that make f : X ^ Y 
continuous. It should be clear that every image and preimage continuous function is continuous, 
but the converse is not true. 

Let sat(C/) := f~ f{U) C X be the saturation of an open set U oi X and comp(l/) := 
ff~{V) = Vf] f{X) G y be the component of an open set F of 1" on the range f{X) of /. Let 
^sat, Vcomp denote respectively the saturations f/gat = {sat(C/) : U £ U} oi the open sets of X 
and the components Vcomp = {comp(l/) : V £ V} of the open sets of Y whenever these are also 
open in X and Y respectively. Plainly, Usa.t ^ U and Vcomp ^ V. 

Definition A2.1. For a function e: X ^ (Y,V), the preimage or initial topology of X based 
on (generated by) e and V is 

lT{e;V} = {UCX:U = e~{V) if F G Vcomp}, (113) 
while for q: {X,U) — > Y, the image or final topology of Y based on (generated by) lA and q is 

FT{U;q} = {VOY:q-iV) = UifU£U,^t}- □ (114) 

Thus, the topology of {X, IT{e; V}) consists of, and only of, the e-saturations of all the open sets 
of e(X), while the open sets of {Y,FT{U; q}) are the g-images in Y (and not just in q(^X)) of 
all the g-saturated open sets of X.^"^ The need for defining in terms of Vcomp rather than 

V will become clear in the following. The subspace topology lT{i;l/{} of a subset A C {X,U) 
is a basic example of the initial topology by the inclusion map i : X ^ A ^ (^X,U), and we 
take its generalization e : (A, IT{e; V}) (Y, V) that embeds a subset A oi X into Y as the 
prototype of a preimage continuous map. Clearly the topology of Y may also contain open sets 
not in e{X), and any subset in y — e(X) may be added to the topology of Y without altering 
the preimage topology of X: open sets of Y not in e{X) may be neglected in obtaining the 
preimage topology as e~(Y — e{X)) = 0. The final topology on a quotient set by the quotient 
map Q : {X,U) — > X/ ~, which is just the collection of Q-images of the Q-saturated open sets 
of X, known as the quotient topology of X/ ~, is the basic example of the image topology and 
the resulting space {X/ ~,FT{Z//;Q}) is called the quotient space. We take the generalization 
q: [X^li) (Y,FT{U;q}) of Q as the prototype of a image continuous function. 

The following results are specifically useful in dealing with initial and final topologies; com- 
pare the corresponding results for open maps given later. 

Theorem A2.1. Let {X^IA) and (Yi, Vi) be topological spaces and let Xi be a set. If f: Xi — > 
(yi,Vi), q: {X,U) — > Xi, and h = f o q: (X,U) (yi,Vi) are functions with the topology Ui 
of Xi given by FT{l/( ; q} , then 

(a) / is continuous iff h is continuous. 

(b) / is image continuous iff Vi = FT{^; h}. □ 
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Figure 21: Continuity in final and initial topologies. 



Theorem A2.2. Let (Y,V) and (Xi,Ui) be topological spaces and let Yi he a set. If f : 
{Xi,hii) Yi, e: Yi ^ {Y, V) and g = e o f : {Xi,Ui) (Y, V) are function with the topology 
Vi ofYi given by IT{e; V}, then 

(a) / is continuous iff g is continuous. 

(b) / is preimage continuous iff Ui = ITj^r; V}. □ 

As we need the second part of these theorems in our applications, their proofs are indicated 
below. The special significance of the first parts is that they ensure the converse of the usual 
result that the composition of two continuous functions is continuous, namely that one of the 
components of a composition is continuous whenever the composition is so. 

Proof of Theorem A2.1. If / be image continuous, Vi = {Vi C Yi : f~{Vi) G Ui] and 
lAi = {Ui C Xi : q~{Ui) £ U} are the final topologies of Yi and Xi based on the topologies of 
Xi and X respectively. Then Vi = {Fi C : q~ f~{Vi) £U} shows that h is image continuous. 

Conversely, when h is image continuous, Vi = {Vi C Yi : h~{Vi)} G U} = {Vi C Yi : 
q~f~(yi)} S U}, with Ui = {Ui C Xi : q~iUi) G U}, proves f~{Vi) to be open in Xi and 
thereby / to be image continuous. 

Proof of Theorem A2.2. If / be preimage continuous, Vi = {Vi ^Yi:Vi = e~{V) if F G V} 

and Ui = {Ui C Xi: Ui = f~{Vi) if Vi G Vi} are the initial topologies of Yi and Xi respectively. 
Hence from Ui = {Ui Xi : Ui = f~e~{V) ii V £ V} it follows that g is preimage continuous. 

Conversely, when g is preimage continuous, Ui = {Ui Q Xi : Ui = g~{V) if G V } = 
{Ui QXi:Ui = f-e-{V) if V G V} and Vi = {^i ^Yi:Vi = e-{V) if ^ G V} shows that / is 
preimage continuous. ■ 

Since both Eqs. H113() and (|114() are in terms of inverse images (the first of which constitutes 
a direct, and the second an inverse, problem) the image f{U) = comp(y) for F G V is of interest 
as it indicates the relationship of the openness of / with its continuity. This, and other related 
concepts are examined below, where the range space f{X) is always taken to be a subspace of 
Y. Openness of a function / : {X,U) — > {Y,V) is the "inverse" of continuity, when images of 
open sets of X are required to be open in Y; such a function is said to be open. Following are 
two of the important properties of open functions. 

(1) Iff: {X,U) {YJ{K)) is an open function, thensoisf<: {XM) ^ {f{X),lT{i-f{U)]). 
The converse is true if f{X) is an open set ofY; thus openness of /< : {X,U) — > {f{X), /<(W)) 
implies that of f : {X,U) — > {Y,V) whenever f{X) is open in Y such that f<{U) G V for U £U. 

^^We adopt the convention of denoting arbitrary preimage and image continuous functions by e and q 
respectively even though they are not be injectivc or surjective; recall that the embedding e: X D A ^ Y 
and the association q: X ^ f{X) are 1 : 1 and onto respectively. 
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The truth of this last assertion follows easily from the fact that if f<:{U) is an open set of 
f{X) C Y, then necessarily f<:{U) = Vf^f{X) for some V £ V, and the intersection of two 
open sets of Y is again an open set of Y. 

(2) Iff: {X,U) (y,V) and 5: {Y,V) {Z,W) are open functions then gof:{X,U)^ 
{Z,W) is also open. It follows that the condition in (1) on f{X) can be replaced by the 
requirement that the inclusion i: {f(X),lT{i;V}) iY,V) be an open map. This interchange 
of f{X) with its inclusion i: f{X) Y into y is a basic result that finds application in many 
situations. 

Collected below are some useful properties of the initial and final topologies that we need 
in this work. 

Initial Topology. In Fig. [21T b). consider Yi = h{Xi), e — > i and / — > /i< : Xi — > 
(^(Xi),IT{i;V}). From = h'{Bf]h{Xi)) for any B CY, it follows that for an open 

set V of Y, /i~(Vcomp) = h~{V) is an open set of Xi which, if the topology of Xi is IT{/i; V}, 
are the only open sets of Xi. Because Fcomp is an open set of h{Xi) in its subspace topology, 
this implies that the preimage topologies lT{h;V} and IT{/i<; IT{i; V}} of Xi generated by h 
and /i< are the same. Thus the preimage topology of Xi is not affected if Y is replaced by the 
subspace h{Xi), the part Y — h{Xi) contributing nothing to IT{/i; V}. 

A preimage continuous function e : X — > (Y, V) is not necessarily an open function. Indeed, 
if [/ = e~{V) £ IT{e; V}, it is almost trivial to verify along the lines of the restriction of open 
maps to its range, that e{U) = ee~{V) = e{X) f]V , V £ V, is open in Y (implying that e is an 
open map) iff e{X) is an open subset of Y (because finite intersections of open sets are open). 
A special case of this is the important consequence that the restriction e< : (X, IT{e;V}) 
(e(X),IT{i; V}) of e : {X,YT{h;V}) {Y,V) to its range is an open map. Even though a 
preimage continuous map need not be open, it is true that an injective, continuous and open 
map / : X — > (Y, V) is preimage continuous. Indeed, from its injectivity and continuity, inverse 
images of all open subsets of Y are saturated-open in X, and openness of / ensures that these 
are the only open sets of X the condition of injectivity being required to exclude non-saturated 
sets from the preimage topology. It is therefore possible to rewrite Eq. as 

U G IT{e; V} ^ e{U) = V if V £ Vcomp, (115) 

and to compare it with the following criterion for an injective, open- continuous map f : {X, U) 
(y, V) that necessarily satisfies sat (A) = A for all A C X 

U£U^ {{{f{U)}ueu = Vcomp) A {f~HV)\vev G U). (116) 

Final Topology. Since it is necessarily produced on the range TZ{q) of q, the final topology is 
often considered in terms of a surjection. This however is not necessary as, much in the spirit 
of the initial topology, Y — q{X) 7^ inherits the discrete topology without altering anything, 
thereby allowing condition ()114|1 to be restated in the following more transparent form 

V £ FT{U; q}^V = q{U) iiU £ Z^sat, (117) 

and to compare it with the following criterion for a surjective, open- continuous map f : {X,U) — > 
(y, V) that necessarily satisfies fB = B for all S C y 

y G V ^ (Z^sat = {f-{V)}vev) A {f{U)\ueU G V). (118) 

As may be anticipated from Fig. 1211 the final topology does not behave as well for subspaces as 
the initial topology does. This is so because in Fig. I2ir a) the two image continuous functions h 
and q are connected by a preimage continuous inclusion /, whereas in Fig. I2ir b) all the three 
functions are preimage continuous. Thus quite like open functions, although image continuity 
of h: {XM) {Yi,¥T{U;h}) implies that of /i< : {X,U) ^ {h{X),lT{i;YT{U-h})) for a 
subspace h{X) of Yi, the converse need not be true unless — entirely like open functions again 
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— either h{X) is an open set of Yi ov i: {h{X), YT{i; FT {U ; h})) {X,FT{l(;h}) is an open 
map. Since an open preimage continuous map is image continuous, this makes i : h{X) Yi 
an ininal function and hence all the three legs of the commutative diagram image continuous. 

Like preimage continuity, an image continuous function q : {X,IA) — > Y need not he open. 
However, although the restriction of an image continuous function to the saturated open sets 
of its domain is an open function, q is unrestrictedly open iff the saturation of every open set 
of X is also open in X. Infact it can be verified without much effort that a continuous, open 
surjection is image continuous. 

Combining Eqs. H115() and H117() gives the following criterion for ininality 

[/ and y G IFT{^/sat; /; V} ^ {{f{U)}ueU.^, = V)(Z^sat = {f~{V)}vev), (119) 

which reduces to the following for a homeomorphism / that satisfies both sat (A) = A for A Q X 
and fB = B ioi B <^Y 

f/ and y G HOM{^/; /; V} ^ {U = {f~\V)}v^v){{f{U)}um = V) (120) 
and compares with 

[/ and y G OC{U;f-V} ^ (sat(C/) ^U: {f{U)}u&i = Vcomp)A 

A (comp(y) G V: {f-{V)}v^v = Z^sat) (121) 

for an open-continuous /. 

The following is a slightly more general form of the restriction on the inclusion that is needed 
for image continuity to behave well for subspaces of Y . 

Theorem A2.3. Let q : {X,U) {Y,FT{U; q}) be an image continuous function. For a 
subspace B of {Y, FT{U; q}), 

FT{lT{j;U};q<}=lT{i;FT{U;q}} 

where g< : {q~ {B),YT{j;l{}) (i?, FT{IT{j;^}; g<}), if either q is an open map or B is an 
open set of Y. □ 

In summary we have the useful result that an open preimage continuous function is image 
continuous and an open image continuous function is preimage continuous, where the second 
assertion follows on neglecting non-saturated open sets in X; this is permitted in as far as the 
generation of the final topology is concerned, as these sets produce the same images as their 
saturations. Hence an image continuous function q: X ^ Y is preimage continuous iff every 
open set in X is saturated with respect to q, and a preimage continuous function e: X ^ Y is 
image continuous iff the e-image of every open set of X is open in Y . 



A3. More on Topological Spaces 

This Appendix — which completes the review of those concepts of topological spaces begun in 
Tutorial4 that are needed for a proper understanding of this work — begins with the following 
summary of the different possibilities in the distribution of Der(A) and Bdy(A) between sets 
A <Z X and its complement X — A, and follows it up with a few other important topological 
concepts that have been used, explicitly or otherwise, in this work. 

Definition A3.1. Separation, Connected Space. A separation (disconnection) of X is a 
pair of mutually disjoint nonempty open (and therefore closed) subsets Hi and H2 such that 
X = Hi U H2. A space X is said to be connected if it has no separation, that is if it cannot be 
partitioned into two open or two closed nonempty subsets. X is separated (disconnected) if it 
is not connected. □ 
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Der(^) I I Bdyx-A(^) ■ ; Der(X - A) \ ; BdyA(X - .4) 



Figure 22: Classification of a subset A oi X relative to the topology of X. The derived set of A may 
intersect both A and X — A (row 3), may be entirely in A (row 2), or may be wholly \j\ X — A (row 1). 
A is closed iff Bdy(A) := Bdy^_^(A) U Bdy^(X - A) ^ (C1(A) n (X - A)) U (C1(X - A) n A) C A (row 
2), open iff Bdy(A) C X — A (column 2), and clopen iff Bdy(A) = when the derived sets of both A 
and X — A are contained in the respective sets. An open set, beside being closed, may also be neutral 
or donor. 
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It follows from the definition, that for a disconnected space X the following are equivalent 
statements. 

(a) There exist a pair of disjoint nonempty open subsets of X that cover X. 

(b) There exist a pair of disjoint nonempty closed subsets of X that cover X. 

(c) There exist a pair of disjoint nonempty clopen subsets of X that cover X. 

(d) There exists a nonempty, proper, clopen subset of X. 

By a connected subset is meant a subset of X that is connected when provided with its relative 
topology making it a subspace of X. Thus any connected subset of a topological space must 
necessarily be contained in any clopen set that might intersect it: if C and H are respectively 
connected and clopen subsets of X such that C f]H / 0, then C C H because Cf]H is a 
nonempty clopen set in C which must contain C because C is connected. 

For testing whether a subset of a topological space is connected, the following relativized 
form of (a) — (d) is often useful. 

Lemma A3.1. A subset A of X is disconnected iff there are disjoint open sets U and V of X 
satisfying 

Uf]A^(l>^Vf]AsuchtliatACU\jV, with U f]V f] A = (ll (122) 
or tliere are disjoint closed sets E and F of X satisfying 

Ef]A^ / FflA such that ACE\JF, with Ef]Ff]A = 0. (123) 

Thus A is disconnected iff there are disjoint clopen subsets in the relative topology of A that 
cover A. □ 

Lemma A3. 2. If A is a suhspace of X, a separation of A is a pair of disjoint nonempty subsets 
Hi and H2 of A whose union is A neither of which contains a cluster point of the other. A is 
connected iff there is no separation of A. □ 

Proof. Let Hi and H2 be a separation of A so that they are clopen subsets of A whose union 
is A. As Hi is a closed subset of A it follows that Hi = C\x{Hi) P\A, where C\x{Hi)f]A is 
the closure of Hi in A; hence Clx(-ffi)n-^2 = 0- But as the closure of a subset is the union 
of the set and its adherents, an empty intersection signifies that H2 cannot contain any of the 
cluster points of Hi. A similar argument shows that Hi does not contain any adherent of H2- 

Conversely suppose that neither Hi nor H2 contain an adherent of the other: Clx{Hi) f] H2 = 
and Clx{H2) f]Hi = $. Hence ClxiHi) p\A = Hi and Clx{H2) r\A = H2S0 that both Hi 
and H2 are closed in A. But since Hi = A — H2 and H2 = A — Hi, they must also be open in 
the relative topology of A. ■ 

Following are some useful properties of connected spaces. 

(cl) The closure of any connected subspace of a space is connected. In general, every B 
satisfying 

A C S C C\{A) 

is connected. Thus any subset of X formed from A by adjoining to it some or all of its adherents 
is connected so that a topological space with a dense connected subset is connected. 

(c2) The union of any class of connected subspaces of X with nonempty intersection is a 
connected subspace of X. 

(c3) A topological space is connected iff there is a covering of the space consisting of con- 
nected sets with nonempty intersection. Connectedness is a topological property: Any space 
homeomorphic to a connected space is itself connected. 

(c4) If Hi and H2 is a separation of X and A is any connected subset A of X, then either 
AC Hi or AC H2. 

While the real line M is connected, a subspace of M is connected iff it is an interval in M. 
The important concept of total disconnectedness introduced below needs the following 
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Definition A3. 2. Component. A component C* of a space X is a maximally (with respect 
to inclusion) connected subset of X. □ 



Thus a component is a connected subspace which is not properly contained in any larger con- 
nected subspace of X. The maximal element need not be unique as there can be more than 
one component of a given space and a "maximal" criterion rather than "maximum" is used as 
the component need not contain every connected subsets of X; it simply must not be contained 
in any other connected subset of X. Components can be constructively defined as follows: Let 
X € X he any point. Consider the collection of all connected subsets of X to which x belongs 
Since {x} is one such set, the collection is nonempty. As the intersection of the collection is 
nonempty, its union is a nonempty connected set C. This the largest connected set containing 
X and is therefore a component containing x and we have 

(CI) Let X £ X. The unique component of X containing x is the union of all the connected 
subsets of X that contain x. Conversely any nonempty connected subset A of A is contained 
in that unique component of A to which each of the points of A belong. Hence a topological 
space is connected iff it is the unique component of itself. 

(C2) Each component C* of A is a closed set of A: By property (cl) above, C1(C*) is also 
connected and from C* C C1(C*) it follows that C* = C1(C*). Components need not be open 
sets of A: an example of this is the space of rationals Q in reals in which the components are 
the individual points which cannot be open in M; see Example (2) below. 

(C3) Components of A are equivalence classes of (A, ~) with x ~ y iff they are in the same 
component: while reflexivity and symmetry are obvious enough, transitivity follows because if 
x,y G Cl and y,z G C2 with Ci, C2 connected subsets of A, then x and z are in the set Ci \J C2 
which is connected by property c(2) above as they have the point y in common. Components 
are connected disjoint subsets of A whose union is A (that is they form a partition of A with 
each point of A contained in exactly one component of A) such that any connected subset of 
A can be contained in only one of them. Because a connected subspace cannot contain in it 
any clopen subset of A, it follows that every clopen connected subspace must be a component of 



Even when a space is disconnected, it is always possible to decompose it into pairwise disjoint 
connected subsets. If A is a discrete space this is the only way in which A may be decomposed 
into connected pieces. If A is not discrete, there may be other ways of doing this. For example, 
the space 



Intuition tells us that only in the third of these decompositions have we really broken up A 
into its connected pieces. What distinguishes the third from the other two is that neither of the 
pieces [0, 1] or (2, 3) can be enlarged into bigger connected subsets of A. 

As connected spaces, the empty set and the singleton are considered to be degenerate and 
any connected subspace with more than one point is nondegenerate. At the opposite extreme of 
the largest possible component of a space A which is A itself, are the singletons {x} for every 
X € X. This leads to the extremely important notion of a 

Definition A3. 3. Totally disconnected space. A space X is totally disconnected if every 
pair of distinct points in it can be separated by a disconnection of X. □ 

A is totally disconnected iff the components in A are single points with the only nonempty 
connected subsets of A being the one-point sets: Uxy^y^ACX are distinct points of a 



A. 



A = {x e M: (0 < X < 1) V (2 < .X < 3)} 

has the following three distinct decomposition into connected subsets: 

A = [0,l/2)U[l/2,l]U(2,7/3]U(7/3,3) 



A 




A = 



[0,1]U(2,3). 
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subset AoiX then A = {Af\ Hi) [j{A f| H2), where X = Hi[jH2 with x e Hi and y € H2 is a 
disconnection of X (it is possible to choose Hi and H2 in this manner because X is assumed to 
be totally disconnected), is a separation of A that demonstrates that any subspace of a totally 
disconnected space with more than one point is disconnected. 

A totally disconnected space has interesting physically appealing separation properties in 
terms of the (separated) Hausdorff spaces; here a topological space X is Hausdorjf, or T2, iff 
each two distinct points of X can be separated by disjoint neighbourhoods, so that for every 
X ^ y & X , there are neighbourhoods M £ Mx and G My such that M f]N = 0. This means 
that for any two distinct points x ^ y £ X , it is impossible to find points that are arbitrarily 
close to both of them. Among the properties of Hausdorff spaces, the following need to be 
mentioned. 

(HI) X is Hausdorff iff for each x € X and any point y ^ x, there is a neighbourhood N 
of X such that y Cl(A^). This leads to the significant result that for any x £ X the closed 
singleton 

{x}= fl C\{N) 

is the intersection of the closures of any local base at that point, which in the language of nets 
and filters (Appendix Al) means that a net in a Hausdorff space cannot converge to more than 
one point in the space and the adherent set adh(A4) of the neighbourhood filter at x is the 
singleton {x}. 

(H2) Since each singleton is a closed set, each finite set in a Hausdorff space is also closed 
in X. Unlike a cofinite space, however, there can clearly be infinite closed sets in a Hausdorff 
space. 

(H3) Any point x in a Hausdorff space X is a cluster point of A C X iff every neighbourhood 
of X contains infinitely many points of A, a fact that has led to our mental conditioning of the 
points of a (Cauchy) sequence piling up in neighbourhoods of the limit. Thus suppose for 
the sake of argument that although some neighbourhood of x contains only a finite number 
of points, X is nonetheless a cluster point of A. Then there is an open neighbourhood U of 
X such that U f]{A — {x}) = {xi,--- is a finite closed set of X not containing x, and 

U f^{X — {xi, ■ ■ ■ ,Xn}) being the intersection of two open sets, is an open neighbourhood of 
X not intersecting A — {x} implying thereby that x Der(A); infact U f]{X — {xi, ■ ■ ■ ,Xn}) 
is simply {x} if x G ^ or belongs to Jidyx-Ai^) when x G X — A. Conversely if every 
neighbourhood of a point of X intersects A in infinitely many points, that point must belong 
to Der(^) by definition. 

Weaker separation axioms than Hausdorffness are those of Tq, respectively Ti, spaces in 
which for every pair of distinct points at least one, respectively each one, has some neighbour- 
hood not containing the other; the following table is a listing of the separation properties of 
some useful spaces. 

It should be noted that that as none of the properties (H1)-(H3) need neighbourhoods of 
both the points simultaneously, it is sufficient for X to be Ti for the conclusions to remain valid. 

Prom its definition it follows that any totally disconnected space is a Hausdorff space and 
is therefore both Ti and Tq spaces as well. However, if a Hausdorff space has a base of clopen 
sets then it is totally disconnected; this is so because if x and y are distinct points of X, 
then the assumed property oi x € H <^ M for every M £ Mx and some clopen set M yields 
X = Hy_}[X — H) as a disconnection of X that separates x and y £ X — H; note that the 
assumed Hausdorffness of X allows M to be chosen so as not to contain y. 

Example A3.1. (1) Every indiscrete space is connected; every subset of an indiscrete space is 
connected. Hence if X is empty or a singleton, it is connected. A discrete space is connected 
iff it is either empty or is a singleton; the only connected subsets in a discrete space are the 
degenerate ones. This is an extreme case of lack of connectedness, and a discrete space is the 
simplest example of a total disconnected space. 
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/ 
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X 


74-inclusion /exclusion 


X 


X 


X 



Table 7: Separation properties of some useful spaces. 



(2) Q, the set of rationals considered as a subspace of the real line, is (totally) disconnected 
because all rationals larger than a given irrational r is a clopen set in Q, and 

Q = ((—00, r) Pi Q) UC'Q n(^) ^)) ''' is irrational 

is the union of two disjoint clopen sets in the relative topology of Q. The sets (— oo,r) n Q 
and Q H (r, cxd) are clopen in Q because neither contains a cluster point of the other. Thus for 
example, any neighbourhood of the second must contain the irrational r in order to be able to 
cut the first which means that any neighbourhood of a point in either of the relatively open sets 
cannot be wholly contained in the other. The only connected sets of Q are one point subsets 
consisting of the individual rationals. In fact, a connected piece of Q, being a connected subset 
of M, is an interval in M, and a nonempty interval cannot be contained in Q unless it is a 
singleton. It needs to be noted that the individual points of the rational line are not (cl)open 
because any open subset of M that contains a rational must also contain others different from it. 
This example shows that a space need not be discrete for each of its points to be a component 
and thereby for the space to be totally disconnected. 

In a similar fashion, the set of irrationals is (totally) disconnected because all the irrationals 
larger than a given rational is an example of a clopen set in R — Q. 

(3) The p-inclusion (A-inclusion) topology is connected; a subset in this topology is connected 
iff it is degenerate or contains p. For, a subset inherits the discrete topology if it does not contain 
p, and p-inclusion topology if it contains p. 

(4) The cofinite (cocountable) topology on an infinite (uncountable) space is connected; a 
subset in a cofinite (cocountable) space is connected iff it is degenerate or infinite (countable). 

(5) Removal of a single point may render a connected space disconnected and even totally 
disconnected. In the former case, the point removed is called a cut point and in the second, it 
is a dispersion point. Any real number is a cut point of M and it does not have any dispersion 
point only. 

(6) Let X be a topological space. Considering components of X as equivalence classes by 
the equivalence relation ~ with Q : X — > X/ ~ denoting the quotient map, X/ ~ is totally 
disconnected: As (5~([x]) is connected for each [x] £ X/ ~ in a component class of X, and as 
any open or closed subset A C X/ ~ is connected iff Q~{A) is open or closed, it must follow 
that A can only be a singleton. ■ 

The next notion of compactness in topological spaces provides an insight of the role of 
nonempty adherent sets of filters that lead in a natural fashion to the concept of attractors in 
the dynamical systems theory that we take up next. 

Definition A3. 4. Compactness. A topological space X is compact iff every open cover of X 
contains a Enite suhcover of X. □ 
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This definition of compactness has an useful equivalent contrapositive reformulation: For any 
given collection of open sets of X if none of its finite subcoUections cover X, then the entire 
collection also cannot cover X. The following theorem is a statement of the fundamental prop- 
erty of compact spaces in terms of adherences of filters in such spaces, the proof of which uses 
this contrapositive characterization of compactness. 

Theorem A3.1. A topological space X is compact iff each class of closed subsets of X with 
finite intersection property has nonempty intersection. □ 

Proof. Necessity. Let X be a compact space. Let = {-FojogD be a collection of closed 
subsets of X with finite FIP, and let Q = {X — Fa}a£0 be the corresponding open sets of X. 
If {Gi}^i is a nonempty finite subcollection from Q, then {X — Gi}^^ is the corresponding 
nonempty finite subcollection of J^. Hence from the assumed finite intersection property of 
it must be true that 

N N 

X-\jGi = f]{X-G^) (DeMorgan's Law) 

1=1 1=1 

+ 0, 

so that no finite subcollection of Q can cover X. Compactness of X now implies that Q too 
cannot cover X and therefore 

f|F„ = f|(X - G„) = X - |JG„ / 0. 

OL ct a 

The proof of the converse is a simple exercise of reversing the arguments involving the two 
equations in the proof above. ■ 

Our interest in this theorem and its proof lies in the following corollary — which essentially 
means that for every filter T on a compact space the adherent set adh(.7-") is not empty — from 
which follows that every net in a compact space must have a convergent subnet. 

Corollary. A space X is compact iff for every class A = {A^) of nonempty subsets of X with 
FlF,adHA) = f]A^^^Cl{A^)^(!}. □ 

The proof of this result for nets given by the next theorem illustrates the general approach in 
such cases which is all that is basically needed in dealing with attr actors of dynamical systems; 
compare Theorem Al.3. 

Theorem A3. 2. A topological space X is compact iff each net in X adheres in X. □ 
Proof. Necessity. Let X be a compact space, x-^ ^ ^ Si net in X, and Mq the residual of a 
in the directed set B. For the filter-base (Fi3^(iR„))ogD of nonempty, decreasing, nested subsets 
of X associated with the net X) compactness of X requires from C\a-<s ^Kx(Ma) ^ x(Ms) / 0, 
that the uncountably intersecting subset 

adh(F^^) := fl Cl(x(M,)) 

of X be non-empty. If x G adh(F'S^) then because x is in the closure of x(IR/3), it follows from 
Eq. ^ that iVfl / for every N G A4, /3 G B. Hence x{l) ^ ^ for some 7 ^ /3 so 

that X G adh(x); see Eq. (|88|). 

Sufficiency. Let x be a net in X that adheres at x G X. From any class J- of closed subsets 
of X with FIP, construct as in the proof of Thm. A1.4, a decreasing nested sequence of closed 
subsets Gp = r\a-<i3ei]>{'^a '■ Fa S -^l and consider the directed set j^Gp = {{Gp,P) : (/? G 
B)(x^ G Gjs)} with its natural direction H83() to define the net xiC/3,/3) = xp in X] see Def. 
Al.lO. From the assumed adherence of x at some x G X, it follows that N {^F ^ ^ for every 

^''This is of course a triviality if wc identify each x(!^0) (or F in the proof of the converse that follows) 
with a neighbourhood N oi X that generates a topology on X. 
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N £ Mx and F £ T. Hence x belongs to the closed set F so that x G adh(^); see Eq. (|^^ . 
Hence X is compact. ■ 

Using Theorem Al.5 that specifies a definite criterion for the adherence of a net, this theorem 
reduces to the useful formulation that a space is compact iff each net in it has some convergent 
subnet. An important application is the following: Since every decreasing sequence (Fm) of 
nonempty sets has FIP (because nm=i = F]\i for every finite M), every decreasing sequence 
of nonempty closed subsets of a compact space has nonempty intersection. For a complete metric 
space this is known as the Nested Set Theorem, and for [0, 1] and other compact subspaces of 
M as the Cantor Intersection Theorem.^^ 

For subspaces j4 of X, it is the relative topology that determines as usual compactness of A; 
however the following criterion renders this test in terms of the relative topology unnecessary 
and shows that the topology of X itself is sufficient to determine compactness of subspaces: A 
suhspace K of a topological space X is compact iff each open cover of K in X contains a finite 
cover of K. 

A proper understanding of the distinction between compactness and closedness of subspaces 
— which often causes much confusion to the non-specialist — is expressed in the next two 
theorems. As a motivation for the first that establishes that not every subset of a compact 
space need be compact, mention may be made of the subset (a, h) of the compact closed interval 
[a, h] in M. 

Theorem A3. 3. A closed subset F of a compact space X is compact. □ 
Proof. Let Q be an open cover of F so that an open cover of X is ^ |J {X — F) , which because 
of compactness of X contains a finite subcover U. Then U — (X — F) is a finite collection of Q 
that covers F. ■ 

It is not true in general that a compact subset of a space is necessarily closed. For example, 
in an infinite set X with the cofinite topology, let F be an infinite subset of X with X — F also 
infinite. Then although F is not closed in X, it is nevertheless compact because X is compact. 
Indeed, let Q be an open cover of X and choose any nonempty Gq (z G. If Go = ^ then 
{Go} is the required finite cover of X. If this is not the case, then because X — Gq = {xj}"^^ 
is a finite set, there is a Gj € ^ with Xi G Gi for each 1 < i < n, and therefore {Gjj^^Q is 
the finite cover that demonstrates the compactness of the cofinite space X. Compactness of F 
now follows because the subspace topology on F is the induced cofinite topology from X. The 
distinguishing feature of this topology is that it, like the cocountable, is not Hausdorff: If U 
and V are any two nonempty open sets of X, then they cannot be disjoint as the complements 
of the open sets can only be finite and if C/ P| F were to be indeed empty, then 

X = X -(ll = X -{Uf]V) = {X -U) \J{X - V) 

would be a finite set. An immediate fallout of this is that in an infinite cofinite space, a 
sequence (xi)igM (and even a net) with Xi ^ Xj for i ^ j behaves in an extremely unusual way: 
It converges, as in the indiscrete space, to every point of the space. Indeed if x € X, where X is 
an infinite set provided with its cofinite topology, and U is any neighbourhood of x, any infinite 
sequence (xi)igN in X must be eventually in U because X — U is finite, and ignoring of the 
initial set of its values lying in X — [/ in no way alters the ultimate behaviour of the sequence 
(note that this implies that the filter induced on X by the sequence agrees with its topology). 

^■'Nested-set theorem. If (En) is a decreasing sequence of nonempty, closed, subsets of a complete 
metric space {X, d) such that lini„_,oo dia(£'„) = 0, then there is a unique point 

oc 

xe f] En. 

n=0 

The uniqueness arises because the limiting condition on the diameters of En imply, from property (HI), 
that {X, d) is a Hausdorff space. 
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Thus — > X for any x G X is a reflection of the fact that there are no smah neighbourhoods 
of any point of X with every neighbourhood being almost the whole of X, except for a null set 
consisting of only a finite number of points. This is in sharp contrast with Hausdorff spaces 
where, although every finite set is also closed, every point has arbitrarily small neighbourhoods 
that lead to unique limits of sequences. A corresponding result for cocountable spaces can be 
found in Example A1.2 Continued. 

This example of the cofinite topology motivates the following "converse" of the previous 
theorem. 

Theorem A3. 4. Every compact subspace of a Hausdorff space is closed. □ 




Figure 23: Closedness of compact subsets of a Hausdorff space. 



Proof. Let K he a nonempty compact subset of X, Fig. \2'A[ and \et x £ X — K. Because of the 
separation of X, for every y £ K there are disjoint open subsets Uy and Vy of X with y £ Uy, 
and X G Vy. Hence {Uy}y^K is an open cover for K, and from its compactness there is a finite 
subset A of K such that K C IJ^g^ Uy with V = flygA open neighbourhood of x; V is 

open because each is a neighbourhood of x and the intersection is over finitely many points 
y of A. To prove that K is closed in X it is enough to show that V is disjoint from K: If there 
is indeed some z £ V K then z must be in some Uy for y £ A. But as z € ^ it is also in Vy 
which is impossible as Uy and Vy are to be disjoint. This last part of the argument infact shows 
that if K is a compact suhspace of a Hausdorff space X and x ^ K, then there are disjoint open 
sets U and V of X containing x and K. ■ 

The last two theorems may be combined to give the obviously important 

Corollary. In a compact Hausdorff space, closedness and compactness of its subsets are equiv- 
alent concepts. □ 

In the absence of Hausdorffness, it is not possible to conclude from the assumed compactness 
of the space that every point to which the net may converge actually belongs to the subspace. 

Definition A3. 5. A subset D of a topological space {X,U) is dense in X if Cl{D) = X . Thus 
the closure of D is the largest open subset of X, and every neighbourhood of any point of X 
contains a point of D not necessarily distinct from it; refer to the distinction between Eqs. \2fl\) 
and □ 

Loosely, D is dense in X iff every point of X has points of D arbitrarily close to it. A 
self-dense {dense in itself) set is a set without any isolated points; hence A is self-dense iff 
A C Der(A). A closed self-dense set is called a perfect set so that a closed set A is perfect iff it 
has no isolated points. Accordingly 

A is perfect <^=^ A = Der(^), 
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means that the closure of a set without any isolated points is a perfect set. 

Theorem A3. 5. The following are equivalent statements. 

(1) D is dense in X. 

(2) If F is any closed set of X with D Q F, then F = X; thus the only closed superset of D 
is X. 

(3) Every nonempty (basic) open set of X cuts D; thus the only open set disjoint from D is 
the empty set 0. 

(4) The exterior of D is empty. □ 

Proof. (3) If U indeed is a nonempty open set of X with [/ P| = 0, then D (1 X — U ^ X 
leads to the contradiction X = Cl(-D) C C1(X — U) = X — U ^ X , which also incidentally 
proves (2). From (3) it follows that for any open set U of X, C\{U) = C\{U P| D) because if V 
is any open neighbourhood of x E C\{U) then V {^U is a nonempty open set of X that must 
cut D so that Fn(^n^) / implies x € C\{Uf]D). Finally, Cl{Ur\D) C Cl(?7) completes 
the proof. ■ 

Definition A3. 6. (a) A set A <^ X is said to be nowhere dense in X if Int(Cl(A)) = and 
residual in X if Int(A) =0. □ 

A is nowhere dense in X iff 

Bdy{X - Cl{A)) = Bdy(Cl(A)) = Cl{A) 

so that 

Cl{X - Cl{A)) = {X - Cl{A)) U Cl{A) = X 
from which it follows that 

A is nwd in X X — Cl{A) is dense in X 

and 

A is residual in X <^=^ X — A is dense in X. 

Thus A is nowhere dense iff Ext (A) := X — Cl{A) is dense in X, and in particular a closed 
set is nowhere dense in X iff its complement is open dense in X with open-denseness being 
compliment arily dual to closed-nowhere denseness. The rationals in reals is an example of a set 
that is residual but not nowhere dense. The following are readily verifiable properties of subsets 
of X. 

(1) A set A <^ X is nowhere dense in X iff it is contained in its own boundary, iff it is 
contained in the closure of the complement of its closure, that is A C Cl{X — Cl{A)). In 
particular a closed subset A is nowhere dense in X iff ^ = Bdy(^), that is iff it contains no 
open set. 

(2) From M CN ^ C1(M) C Cl(iV) it follows, with M = X- Cl{A) and N = X - A, that 
a nowhere dense set is residual, but a residual set need not be nowhere dense unless it is also 
closed in X. 

(3) Since C1(C1(^)) = Cl{A), Cl{A) is nowhere dense in X iff ^ is. 

(4) For any A C X, both Bdy^(X-A) := Cl{X-A)f]AandBdyx-A{A) ■= Cl(^)n(^-^) 
are residual sets and as Fig. |22l shows 

Bdy^(^) = Bdy^„^(^) [J Bdy^X - A) 

is the union of these two residual sets. When A is closed (or open) in X, its boundary consisting 
of the only component Bdy^{X — A) (or Bdjx-Ai^)) shown by the second row (or column) 
of the figure, being a closed set of X is also nowhere dense in X; infact a closed nowhere dense 
set is always the boundary of some open set. Otherwise, the boundary components of the two 
residual parts — as in the donor-donor, donor-neutral, neutral-donor and neutral-neutral cases 
— need not be individually closed in X (although their union is) and their union is a residual 
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set that need not be nowhere dense in X: the union of two nowhere dense sets is nowhere 
dense but the union of a residual and a nowhere dense set is a residual set. One way in which 
a two-component boundary can be nowhere dense is by having Bdy^(X — A) ^ Der(^) or 
Bdyjf_^(A) 5 Der(X — A), so that it is effectively in one piece rather than in two, as shown in 
Fig. I21b). 





(a) A is isolated 
Der(A) = 
Der(e(A)) C e(^) 



(b) A is nwd 





(c) Bdy(A) is nwd (d) A is Cantor 



Bdy(e(Cl(A))) = Cl(^) Der(A) = Bdy(A) Der(A) = A 



Bdy(Cl(A)) 



Der(e(A)) 



Bdy(e(^)) 



Figure 24: Shows the distinction between isolated, nowhere dense and Cantor sets. Topologically, the 
Cantor set can be described as a perfect, nowhere dense, totally disconnected and compact subset of 
a space. In (b), the closed nowhere dense set Cl(^) is the boundary of its open complement. Here 
downward and upward inclined hatching denote respectively Bdy^(X — A) and Bdy_Y_yi(A). 

Theorem A3. 6. A is nowhere dense in X iff each, non-empty open set of X has a non-empty 
open subset disjoint from Cl{A). □ 

Proof. If [/ is a nonempty open set of X, then Uq = U nExt(A) 7^ as Ext(A) is dense in X; Uq 
is the open subset that is disjoint from Cl{A). It clearly follows from this that each non-empty 
open set of X has a non-empty open subset disjoint from a nowhere dense set A. ■ 

What this result (which follows just from the definition of nowhere dense sets) actually 
means is that no point in Bdyjf_^(j4) can be isolated in it. 

Corollary. A is nowhere dense in X iff Cl{A) does not contain any nonempty open set of X 
iff any nonempty open set that contains A also contains its closure. □ 

Example A3. 2. Each finite subset of is nowhere dense in M""; the set {l/n}^^i is nowhere 
dense in M. The Cantor set C is nowhere dense in [0, 1] because every neighbourhood of any 
point in C must contain, by its very construction, a point with 1 in its ternary representation. 
That the interior and the interior of the closure of a set are not necessarily the same is seen in 
the example of the rationals in reals: The set of rational numbers Q has empty interior because 
any neighbourhood of a rational number contains irrational numbers (so also is the case for 
irrational numbers) and ffi = Int(Cl(Q)) 5 Int(Q) = justifies the notion of a nowhere dense 
set. ■ 

The following properties of C can be taken to define any subset of a topological space as a 
Cantor set; set-theoretically it should be clear from its classical middle-third construction that 
the Cantor set consists of all points of the closed interval [0, 1] whose infinite triadic (base 3) 
representation, expressed so as not to terminate with an infinite string of I's, does not contain 
the digit 1. Accordingly, any end-point of the infinite set of closed intervals whose intersection 
yields the Cantor set, is represented by a repeating string of either or 2 while a non end-point 
has every other arbitrary collection of these two digits. Recalling that any number in [0, 1] 
is a rational iff its representation in any base is terminating or recurring — thus any decimal 
that neither repeats or terminates but consists of all possible sequences of all possible digits 
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represents an irrational number — it follows that both rationals and irrationals belong to the 
Cantor set. 



• — 

(1) 

1 






^(1) 


^(1) 

X3 






^(1) 


(2) 
1 


^(2) 




^(2) 


• — 

(2) 
X5 


^(2) 
Xg 


2.(2) 


^(2) 
Xg 


(3) 
1 

•— • 


^(3) 


^(3) 


^(3) 
•-• 


^(3) 

Xg 

•-• 


^(3) 
«-•■■■ 


^(3) 


^(3) 
•-• 




Figure 25: Construction of the classical f/3-Cantor set. The endpoints of C3, for example, in increasing 
order are: |0, ^| ; ; ||, ^| ; 3I ; ||, ^| ; Iff, || ; |i> if | ; |if C'^ is the union of 2* pairwise 

disjoint closed intervals each of length 3^* and the nonempty infinite intersection C = H^gCi is the 
adherent Cantor set of the filter-base of closed sets {Cq, Ci, C2, • • • }• 

(CI) C is totally disconnected. If possible, let C have a component containing points a 
and b with a < b. Then [a, b] Q C ^ [a, b] C d for all i. But this is impossible because we may 
choose i large enough to have 3~' < 6 — a so that a and b must belong to two different members 
of the pairwise disjoint closed 2* subintervals each of length 3^* that constitutes Ci. Hence 

[a, b] is not a subset of any Ci =^ [a, b] is not a subset of C. 

(C2) C is perfect so that for any x £ C every neighbourhood of x must contain some other 
point of C. Supposing to the contrary that the singleton {x} is an open set of C, there must be 
an e > such that in the usual topology of M 

{x} =Cf]{x-e,x + e). (124) 

Choose a positive integer i large enough to satisfy 3~* < e. Since x is in every Ci, it must be 
in one of the 2* pairwise disjoint closed intervals [a, b] C {x — e,x + e) each of length 3~* whose 
union is Cj. As [a, b] is an interval, at least one of the endpoints of [a, b] is different from x, and 
since an endpoint belongs to C, Cr\{x — e,x + s) must also contain this point thereby violating 
Eq. (UMl). 

(C3) C is nowhere dense because each neighbourhood of any point of C intersects Ext(C); 
see Thm. A3. 6. 

(C4) C is compact because it is a closed subset contained in the compact subspace [0, 1] 
of M, see Thm. A3. 3. The compactness of [0, 1] follows from the Heine-Borel Theorem which 
states that any subset of the real line is compact iff it is both closed and bounded with respect 
to the Euclidean metric on M. 

Compare (CI) and (C2) with the essentially similar arguments of Example A3. 1(2) for the 
subspace of rationals in M. 



A4. Neutron Transport Theory 

This section introduces the reader to the basics of the linear neutron transport theory where 
graphical convergence approximations to the singular distributions, interpreted here as multi- 
functions, led to the present study of this work. The one-speed (that is mono-energetic) neutron 
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transport equation in one dimension and plane geometry, is 



):^ + ^{x,fl) = - J 0<c<l, -l</i<l (125) 



dx 



where x is a non-dimensional physical space variable that denotes the location of the neutron 
moving in a direction 6 = cos~^(//), ^{x, fi) is a neutron density distribution function such that 
^{x, iJ,)dxdiJ, is the expected number of neutrons in a distance dx about the point x moving at 
constant speed with their direction cosines of motion in dfi about fi, and c is a physical constant 
that will be taken to satisfy the restriction shown above. Case's method starts by assuming the 
solution to be of the form ^,y{x,n) = e~^^^(p{^,i') with a normalization integral constraint of 
X^i J^)dfi = 1 to lead to the simple equation 

{u-fimfi,u) = - (126) 

for the unknown function (j){v,ix). Case then suggested, see Case and Zweifel (|l967l 'l. the non- 
simple complete solution of this equation to be 

cu 1 

4>{n,v) = —V^ + X{v)5{v-ij), (127) 

where A(i/) is the usual combination coefficient of the solutions of the homogeneous and non- 
homogeneous parts of a linear equation, V{-) is a principal value and 5{x) the Dirac delta, to 
lead to the full-range — 1 < < 1 solution valid for — oo < x < oo 

$(x, ^i) = a(i/o)e-^/^«(/'(/x, i/o) + a(-i^o)e^/^«(/)(-i/o, ^A + t a(z.)e-^/X/i, v)dv (128) 



of the one-speed neutron transport equation H125() . Here the real vq and u satisfy respectively 
the integral constraints 

CI/Q , t^O + 1 ^ 11^1 

-^In 7 = 1' t'o > 1 

2 f - 1 

A(i/) = l-yln^, z^G[-l,l], 



with 



2 uq- n 



following from Eq. 

It can be shown ICase and ZweifeJ ( 1967 ) that the eigenfunctions (f>{i', //) satisfy the full-range 
orthogonality condition 

1 

/U(/)(z^, , n)dfi = N{i')5{u — u'), 

1 

where the odd normalization constants N are given by 

Af(±z/o) = j fxcf)^ {±1^0, fM)dfj, for | fo |> 1 



= ±- 

and 



Ch'i 



3 



for G [-1,1]. 
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With a source of particles 'ip{xQ,fi) located at x = xq in an infinite medium, Eq. ()128|1 reduces 
to the boundary condition, with fi, u £ [—1, 1], 



for the determination of the expansion coefficients a(ibi'o)i {^(^)}i/e[-i,i] • Use of the above 
orthogonality integrals then lead to the complete solution of the problem to be 



a{u) 



N{u) 



fi'ip{xo, fi)cl){^, i')dfi, V = ibfo or V £ [—1,1]- 



For example, in the infinite-medium Greens function problem with xq = and '^{xq,\i) 
5{n — Ho)/n, the coefficients are a{±i^Q) = (j){fiQ,±UQ)/N{±UQ) when v = ±1^0, and a(i/) 
(j){^io,i^)/N{u) for [-1,1]. 

For a half-space < x < co, the obvious reduction of Eq. (|128j) to 



Jo 

with boundary condition, H-, v £ [0, 1], 

^(xo, /x) = a(i/o)e-"°/'^°0(M, ^o) + C a(z/)e-"°/XM, v)dv, 



(130) 



(131) 



leads to an infinitely more difficult determination of the expansion coefficients due to the more 
involved nature of the orth ogonality relations of the eigenfunctions in the half- interval [0, 1] that 
now reads for v, u' G [0, 1] ICase and ZweifeJ ( 1967 ) 



W{fi)(j){^i,u')(j){i^,u)di^ 
W{fi)(l){n,iJo)(l){ij,,ij)dij, 

W{n)(p{fi, ±z^o)0(/-f, J^o)c?M 

1 

W{n)cl){fi, uo)cl){fi, -u)dn 
W{n)ct){fi,u')(t){fi,-u)dn 



W{u)N{u] 



5{u - v') 



V 2 / 



(132) 



cv 



— (yQ^v)(\){v',-v)X{-v) 



where the half-range weight function M^(^) is defined as 



2(l-c)(z.o + /i)^(-/x) 



(133) 



in terms of the X-function 



X(-^) = exp-|| / 



1 + 



ln(z^ -|- [i)dv 



< < 1, 



l-z/2 

that is conveniently obtained from a numerical solution of the nonlinear integral equation 

1/2(1 -C)-Z.2 



0(-/i) = 1 



2( 



i-c)yo K 



dv 



(134) 



101 



to yield 

^J■ + vi - c 

and the X(ibi/o) satisfy 

Z.2(l-c)-l 



X{uo)X{-uo) 



2{1 - c)viiui - 1) 

Two other useful relations involving the VT-function are given by W{fi)(l){fj,,i'Q)dfi = ci'o/2 
and W{fi)(f){fi, h')dfj, = cu/2. 

The utility of these full and half range orthogonality relations lie in the fact that a suitable 
class of functions of the type that is involved here can always be expanded in terms of them, see 
Case and Zweifell ( 19671 ) . An example of this for a full-range problem has been given above; we 



end this introduction to the generalized — traditionally known as singular in neutron transport 
theory — eigenfunction method with two examples of half-range orthogonality integrals to the 
half-space problems A and B of Sec. 5. 

Problem A: The Milne Problem. In this case there is no incident flux of particles from 
outside the medium at x = 0, but for large x > the neutron distribution inside the medium 
behaves like e^^'^''(f){—i'o, fj,). Hence the boundary condition (|131|1 at x = reduces to 

-(f>{fJ', -z^o) = aA{j^o)(f>{fJ', t'o) + / aA{u)(j){^i, u)dv ^ > 0. 

JO 

Use of the fourth and third equations of Eq. (|132|) and the explicit relation Eq. (|1.S3|) for W{^) 
gives respectively the coefficients 

aA(fo) = X{-uq)/X{vq) 

«aH = -j^c{l-c)ulvX{-v^)X{-u) (135) 

The extrapolated end-point zq of Eq. l|H7|) is related to aA(i^o) of the Milne problem by ap^iyo) = 
-exp(-2zo/fo). 

Problem B: The Constant Source Problem. Here the boundary condition at x = is 

1 = aB(i^o)</'(/U,i^o) + / aB{v)(t){^,v)du > 

Jo 

which leads, using the integral relations satisfied by W , to the expansion coefficients 

aB(z^o) = -2/cu(,X{vo) (136) 
= j^{l-c)v{vQ + v)X{-v) 

where the X{±.vq) are related to Problem A as 

X[vo) = - 



-0^(1 



X{-vo) 



2aA(z^o)(l-c)(z^o -1) 

Ua{vo) (z^g(l-c)-l) 
2{l-c){vl-l) ■ 



This brief introduction to the singular eigenfunction method should convince the reader 
of the great difficulties associated with half-space, half-range methods in particle transport 
theory; note that the X-functions in the coefficients above must be obtained from numerically 
computed tables. In contrast, full-range methods are more direct due to the simplicity of the 
weight function which suggests the full-range formulation of half-range problems presented 
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in Sec. 5. Finally it should be mentioned that this singular eigenfunction method is based on 
the theory of singular integral equations. 
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